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@ Artificial gene coding for authentic human serum albumin, use thereof, and method of producing the same. 

A structural gene coding for authentic human serum 
[bumin, - optionally supplemented by an upstream triplet 
coding for methionine and optionally extended by a synthetic 
prepro* -leader-coding sequence - , wherein the codons of the 
nucleotide sequence have been selected with regard to a 
non-human host, e.g. yeast, chosen for expression of authentic 
human serum albumin, is disclosed. 

Additionally there is disclosed a method of producing said 
gene. 

There are also disclosed a recombinant DNA molecule 
comprising said strucural gene inserted into a vector, and a 
host transformed with said recombinant DNA molecule. 

Furthermore there are disclosed a method of producing 
authentic human serum albumin, an authentic human serum 
albumin resulting from said method, and a pharmaceutical 
composition comprising said resulting human serum albumin. 
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Description 

ARTIFICIAL GENE CODING FOR AUTHENTIC HUMAN SERUM ALBUMIN, USE THEREOF, AND METHOD OF 

PRODUCING THE SAME 

The present invention is directed to a structural gene coding for authentic human serum albumin - optionally 
5 supplemented by an upstream triplet coding for methionine and optionally extended by a synthetic 
prepro*-leader-coding sequence - . to a recombinant DNA molecule comprising said gene inserted into a 
vector, to a host transformed with said DNA molecule, to a method of producing authentic human serum 
albumin, to an authentic human serum albumin and to a pharmaceutical composition comprising authentic 
human serum albumin. The invention is additionally directed to a method of producing a structural gene coding 
10 for authentic human serum albumin. 

Background 

Serum albumin is the major protein component of serum in higher species. Its role is in maintaining osmotic 
75 balance and it is involved in the binding and transport of sparingly soluble metabolic products from one tissue 
to another, especially in the transport of free fatty acids. Human serum albumin is used in therapy for the 
treatment of hypovolemia, shock and hypoalbuminemia. It is also used as an additive in perfusion liquid for 
extracorporeal circulation. Furthermore, human serum albumin is frequently used as experimental antigen. 

Human serum albumin is composed of a single long polypeptide chain comprising nearly 600 amino acid 
20 residues. The amino acid sequence thereof is published. (See e.g. [Lawn R.M., et. al., Nucleic Acids Research, 
Vol. 9, No. 22 (1981) pp. 6103-61 13)]. Commercial human serum albumin is prepared from human plasma. The 
availability of human plasma is limited. 

Careful heat treatment of the product prepared from human plasma must be effected to avoid potential 
contamination of the product by hepatitis B virus and HIV virus. 
25 Since one of the characteristics of HIV virus is to frequently change its antigenic structure, there are no 
guarantees that it will not develop heat resistant variants. 

Obviously, there is a need for artificial authentic human serum albumin that can be produced in unlimited 
quantities. 

30 Prior art 

Several attempts to produce products corresponding to mature human serum albumin by using 
recombinant DNA techniques have been made and published i a. in the following patent applications. 

EP-A-0 073 646 (Genentech Inc), EP-A-0 079 739 (The Upjohn Co.), EP-A-0 091 527 (President and Fellows 
35 of Harvard College), and EP-A-0 198 745 (Genetica). 

All of the above mentioned patent applications have started from isolation of mRNA from human liver, and 
this mRNA has been used to prepare double-stranded cDNA (or fragments thereof). Consequently the codon 
usage in the cDNAs is by nature optimized for human expression. 

It is considered in the art that human codon usage is not ideal for non-human expression. 
40 Prior to the present invention there have not been produced such large DNA sequences as needed for 
authentic human serum albumin (structural gene = 1761 bp) in which the codons are optimized for 
non-human expression. 

In EP-A-0 182 383 (Vepex Contractor Ltd., and MTA Szegedi Biologiai Kdzpontja) is disclosed a process for 
the production of oligo- and polydeoxyribonucleotides by synthesizing the complementary strand of a single 
45 -stranded DNA piece enzymatically. This technique has been partly used in the method of producing a 
structural gene coding for authentic human serum albumin (HSA) according to the present invention, but is 
was combined with a new technique of joining a few large fragments of the gene. 

Description of the invention 

50 

The main object of this invention is to produce authentic human serum albumin with the aid of an artificial 
structural gene having a nucleotide sequence wherein the codons are optimized for non-human expression. 

To realize this object it was first necessary to design an artificial structural gene and to invent a method of 
producing said gene. 

55 

Design of the artificial structural gene 

It was decided to choose codons especially suited for yeast expression as an useful example of non-human 
expression. 

60 The codons were selected from yeast codons for highly expressed yeast proteins (Bennetzen, J.L and Hall, 
B.D. (1982) J. Biol. Chem. 257, 3026-3031. and Sharp, P.M., Tuohy, T.M.F. and Mosurski, K.R. (1986) Nucleic 
Acids Res. 14, 5125-5143). 
In the first instance, the codons most frequently used by yeast were selected, but where appropriate the 
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second or third codon was used. 

The reasons for choosing the second or third codons were a) to avoid the appearance of such restriction 
sites which are to be used during the assembly of the gene-, b) to create one unique cleavage site for a specific 
enzyme, and c) to eliminate 8-base pairs long or longer palindroms within such parts of the gene which are to 
be chemically synthesized and cloned; to avoid possible internal loops or secondary structural formations 5 
within the individual synthetic oligonucleotides. 

Artificial structural gene coding for authentic HSA 

In one aspect of the invention there is provided a structural gene coding for authentic human serum albumin. 10 
Said gene is characterized by a nucleotide sequence wherein the codons have been selected with regard to a 
non-human host chosen for expression of authentic human serum albumin, whereby the selection of the 
codons has been effected so that. 

in the first instance, the codons most frequently used by the chosen non-human host were selected, and 

in the second instance, the codons used by the chosen non-human host in the second or third place were 15 

selected, 

to avoid the appearance of such restriction sites which are to be used during the assembly of the gene, 
to create one unique cleavage site for a specific enzyme, and 

to eliminate 8-base-pairs long or longer palindromes within such parts of the gene which are to be chemically 
synthesized and cloned. 20 

In a variant of this aspect of the invention there is provided a structural gene coding for authentic human 
serum albumin plus an initial extra methionine. In this variant of the gene the nucleotide sequence starts with a 
triplet coding for methionine and the rest of the nucleotide sequence codes for human serum albumin as 
above. When this gene is expressed there is produced either authentic human serum albumin or a methionyl 
derivative thereof, depending on the expression system used. 25 

In an other variant of this aspect of the invention there is provided a structural gene coding for authentic 
human serum albumin, extended by an upstream nucleotide sequence in which the codons have been 
selected with regard to a non-human host and which codes for the amino acid sequence 
Met-Lys-Trp-Val-Thr-Phe-lle-Ser-Leu-Leu-Ph 

A "structural gene 8 is a DNA sequence which codes for a specific peptide or protein through its template or 30 
messenger RNA, and includes stop codon(s). 

A "functional gene" comprises, in addition to a structural gene, flanking sequences. Such- flanking 
sequences comprise regulatory regions, such as a promoter sequence and a transcriptional terminator 
sequence. The flanking regions should be optimized for the specific vectors and hosts used for the expression 
(and production) of the peptide or protein encoded by the structural gene. 35 

In a preferred embodiment of the invention, the structural gene coding for authentic HSA has a nucleotide 
sequence wherein the codons are selected with regard to yeast expression of authentic HSA. Even though 
only codons selected with regard to yeast expression are exemplified in the present specification, the 
teachings given herein will enable a man skilled in the art to design and construct a structural gene coding for 
authentic HSA wherein the nucleotide sequence has codons selected with regard to another non-human host, 40 
such as a bacterial host or a plant host 

The expression "authentic human serum albumin" has been used in this specification and ciaims to define 
an artificially produced protein of non-human origin having an amino acid sequence which corresponds to the 
amino acid sequence of native mature human serum albumin. 

45 

Recombinant DNA molecule 

In an other aspect of the invention there is provided a recombinant DMA molecule comprising a structural 
gene according to the invention Inserted into a vector. 

The recombinant DNA molecule thus comprises a vector into which is inserted a functional gene (including a 50 
structural gene according to the invention), wherein the flanking sequences are adapted for the vector, and the 
host to be used. 

Commonly used vectors are plasmids from bacteria, especially E. coir, and bacteriophages, e.g. lambda 
phage. 

Specific examples of this aspect of the invention are disclosed in the part of this specification describing 55 
preferred embodiments of the invention. 

Transformed host 

In still another aspect of the invention there is provided a host transformed with a recombinant DNA 60 
molecule according to the invention. 

Even though the codons of the nucleotide sequence in the structural gene (in a preferred embodiment of 
the invention) are selected with regard to a yeast host, yeast strains are not the only hosts which can be used. 
The structural gene designed for yeast expression may also be suited for bacterial or plant expression. Thus 
the host can be a yeast cell, e.g. Saccharomyees cerevisiae, a bacterial cell, such as E. coll or Bacillus subtilis, 65 
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or a cell of a plant, such as bean plants, pea plants or tobacco plants. 

Method of producing the artificial structural gene 

5 In a further aspect of the invention there is provided a method of producing a structural gene coding for 
authentic human serum albumin. Said method comprises the following steps, 

a) designing the nucleotide sequence coding for authentic human serum albumin by selecting codons 
with regard to a non-human host chosen for expression of authentic human serum albumin, whereby the 
selection of the codons is effected so that 

10 in the first instance, codons most frequently used by the chosen non-human host are selected, and 

in the second instance, codons used by the chosen non-human host in the second or third place are 
selected, 

to avoid the appearance of such restriction sites which are used during the assembly of the gene, 
to create one unique cleavage site between a 5'-f ragment and the rest of the whole gene, and 
15 to eliminate 8-base-pairs long or longer palindromes within oligonucleotide subunits of fragments to be 

cloned, 

b) dividing the designed nucleotide sequence into a 5'-fragment to be chemically synthesized and a few 
fragments to be cloned so that joining points between said few fragments will be at suitably located G-C 
dinucieotide sequences, 

20 c) modifying said designed few fragments of b) by supplementing the designed nucleotide sequences 

thereof with an extra nucleotide sequence GGTAC at the 5'-terminus, except for the fragment to be joined 
to the 5'-fragment of b), and further dividing said few fragments into subunits having a 3'-nucIeotide G, 
which subunits in turn are individually supplemented with an extra nucleotide sequence GGCC; 

d) individually chemically synthesizing the modified supplemented subunits of c) in single-stranded 
25 form in per se known manner, and chemically synthesizing the 5' fragment of b) in double-stranded form 

in per se known manner; 

e) consecutively cloning the synthesized subunits of d) starting from the 5' -terminus of the modified 
supplemented few fragments of c) into a few individual recombinant vectors in per se known manner, with 
the aid of adapters and enzymaticai filling-in reaction, to form cloned double-stranded fragments of the 

30 gene, which correspond to the modified supplemented few fragments of c) , 

f) assembling the cloned double-stranded fragments of e) by cleaving the few recombinant vectors of 
e), in pairs, with the enzyme Kpnl and the enzyme Apal, respectively, - one at the created 5'-terminal Kpnl 
restriction site, and the other at the created 3' terminal Apal restriction site, - to form sticky ends which 
are made blunt ends by a single-strand-specific enzyme in per se known manner - leaving an 

35 end-nucleotide C and an end-nucleotide G, respectively - followed by cleavage with another restriction 

enzyme having a cleavage site which is unique in both of the recombinant vectors of the pair in question, 
to form on the one hand a linear vector containing a cloned fragment of the gene and, on the other hand, a 
cleaved-off fragment of the gene, which two last-mentioned fragments are, in per se known manner, 
enzymatically joined at the blunt ends - a dinucieotide G-C which is included in the nucleotide sequence 

40 of the gene, being formed at the joining point - 

to obtain a recombinant vector which finally includes all the few designed fragments of b) in 
double-stranded form, and 

g) supplementing the recombinant vector obtained in f) with the chemically synthesized 5' fragment of 
d) to form the whole structural gene coding for authentic human serum albumin. 

45 The designed structural gene, having 1761 nucleotides coding for authentic mature human serum albumin 
having 585 amino acid residues, was in a preferred embodyment divided into five large fragments. The first 
fragment was synthesized double stranded in per se known manner, and the second to fifth fragments were 
produced according to the technique disclosed in EP-A-0 182 383. whereby a single strand is chemically 
synthesized and the complementary strand is enzymatically synthesized. 

50 The expression "a unique cleavage site" means that the cleavage site is characteristic of a specific enzyme 
and that it does not occur anywhere else in the fragments to be joined. 

The technique of joining two fragments having selected end-nucleotides was also used later in the 
intermediate plasmid constructions leading to the yeast expression vector. 
The details of the method of the invention are described in connection with the preferred embodiments of 

55 the invention. 

Method of producing authentic HSA 

In yet another aspect of the invention there is provided a method of producing authentic human serum 
60 albumin by propagating a host transformed with a vector comprising a recombinant DNA sequence under 
expression and optionally secretion conditions and isolating the expressed and optionally secreted protein 
product. The characteristic features of this method are a) that a host transformed with a vector comprising a 
structural gene according to the invention is utilized, and b) that authentic human serum albumin or optionally 
the methionyl derivative thereof is isolated. 
65 In a preferred embodiment of this aspect of the invention the host used is Saccharomyces cerevisiae 



4 



EP 0 308 381 A1 



transformed with a shuttle vector (E. coli- yeast) comprising a structural gene coding for authentic human 
serum albumin, said gene being composed of a nucleotide sequence wherein the codons have been selected 
with regard to a yeast host. 

Authentic HSA 5 

In still another aspect of the invention there is provided authentic human serum albumin resulting from the 
method of producing the same according to the invention. This authentic HSA can be used for alt applications 
instead of native mature HSA. 

10 

Pharmaceutical composition 

In an additional aspect of the Invention there is provided a pharmaceutical composition comprising 
authentic human serum albumin according to the invention in admixture with a pharmaceutical^ acceptable 
carrier and/or diluent. Suitable carriers and/or diluents are those used for native HSA, such as saline solution, is 
and reference is made to e.g. US Pharmacopoeia for guidance. The same also applies to conventional 
additives, such as preservatives, pH regulators, buffers etc which may optionally be included. 

Description of preferred embodiments and experimental details 

20 

Short description of the drawings 

The drawings relate to plasmid constructions and to a fluorograph. Specifically, 

Fig. 1 shows the physical map of the coliplasmid pGB1. 25 
Fig. 2 shows the map of plasmid pGB2 containing a yeast HIS3 gene. 

Fig. 3 shows the map of the plasmid pGB3-229T (a.) and the construction by steps 1 and 2 of the basic 
expression vector pPT2HKi (c.) through an intermediate construction pGB3-229TK° (b.). 
Fig. 4 shows the map of pGB2 (HIS3, PH05, PH03). 

Fig. 5 shows the map of plasmid pUC18/623P (a.) containing the promoter of the yeast PHQ5 gene and 30 
the modifications (1., 2. and 3.) leading to the construction of plasmids pUC18V623P (b.) and 
pUC18V622PH (c). 

Fig. 6 shows the physical map of the basic expression vector plasmid pPT2HKi. 

Fig. 7 shows the construction of the yeast-E.coli shuttle vector plasmid pBY200. 

Fig. 8 shows the construction of two expression vector plasmids pYHSA 221 and pBY2/HSA containing 35 
the entire °HSA-express(on cartridge 0 from pPT2/HSA. 

Fig. 9 shows the flow diagram of the construction of yeast vectors to express a synthetic HSA gene. 

Fig. 10 shows the fluorograph of the 3S S-methionine-labeled proteins immunoprecipitated with horse 
anti-HSA serum and resolved in SDS-poIyacrylamide gel. 

Fig. 1 1 shows the construction of a yeast expression plasmid containing an artificial prepro-leader 40 
coding sequence and an artificial gene coding for HSA (No 1). 

Fig 12 shows the products of CNBr-cleavage of purified natural HSA (A and C) and of yeast-produced 
HSA (B and O) resolved by SDS-polyacrytamide gelelectrophoresis. The Commassie-stained gel was also 
subjected to laser-scanning (using LKB-Ultro-Scan). 

Fig. 13 shows a Western-blot of HSA expressed and secreted by the yeast "YEprepro*-HSA° (tracks B 45 
and C) compared to proteins expressed by YHSA-221 (tracks D and E). Track A shows a purified HSA 
sample. 

Scheme 1 - Map of the artificial HSA gene 
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60 

Roman numerals: large HSA fragments: HSA I, II, 111, IV, V. 

Arabic numerals: synthetic oligodeoxyribonucleotides HSA 1 t 2, 3 .... 24, each containing an extra GGCC 
sequence at their 3'-terminus. This extra sequence does not show up in the HSA sequence. HSA 7, 13 and 19 
oligonucleotides also contain an extra GGTAC sequence at their 5'-terminus ( which does not show up in the 
final HSA sequence. 
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When an oligonucleotide (HSA 1) is ligated with the adapter molecule, it will be called, e.g. HSA 1 + A (see 
Scheme 2). 

When HSA 1 + A is cloned into the commonly used E. coli vector pUC19 [Yannisch-Perron C., Vieira, J. and 
Messing, J. Gene 33, 103-119, (1985)] the plasmid obtained is called pHSA 1. 
5 When HSA 2+ A is cloned into the above obtained pHSA 1, the resulting plasmid is called pHSA (1-2). 
Subsequent clonings will result in pHSA (1-6), which plasmid contains the HSA II large fragment cloned in 
pUC19, and it can be called pHSA II. 

Similarly. HSA III, HSA IV and HSA V large fragments are obtained from oligonucleotides 7-12, 13-18 and 
19-24, respectively, resulting in pHSA 111, pHSA IV and pHSA V plasmids. 
10 When HSA II and HSA III large fragments are joined in pUC19 vector, the resulting plasmid can be called 
pHSA (1-12) or rather pHSA (li-lil). 

Similarly, when HSA IV and HSA V large fragments are joined in pUC19 vector, the resulting plasmid can be 
called pHSA (13-24) or rather pHSA (IV-V). 
When HSA (IMII) and HSA (IV-V) are joined, they will result in pHSA (ll-V) in which nearly the whole coding 
15 region of HSA (from 13 - 585 amino acids of mature - whithout N-terminal Met) is cloned. 

When pHSA (ll-V) is supplemented with HSA I fragment in pUC19. the resulting plasmid will be named pHSA. 
HSA I fragment was synthesized as a partial duplex in two forms. (Scheme 4). 
Accordingly, two versions of pHSA, namely pHSA No 1 and pHSA No 2 are obtained (Scheme 5). 
From pHSA No 2, Met-HSA coding gene can be obtained (as a fragment with blunt-end and with Sacl end, 
20 Scheme 6). 

From pHSA No 1 , mature HSA coding gene can be obtained (as a fragment with blunt-end and with Sacl 
end, Scheme 7). 

Either Met-HSA or mature HSA coding DNA region can be cloned into pPT2HKi E. coli vector containing the 
PH05 yeast promoter + signal sequence coding region and the His3 yeast transcription terminator. (TO obtain 
25 pPT2/HSA). The promoter-signal sequence - HSA gene - terminator cassette will be incorporated into a 
self-replicating yeast vector pBY200 for HSA expression. 



Scheme 2 -'Example of the ligation of an oligonucleotide 
with adapter molecule 
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HSA 1 



GGGCC 
PC 

CCCGGG 



adapter 



G 

CTTAA 



45 



50 



-GGGCCC 
CCCGGG 



G 

CTTAA 



HSA 1+A 



The HSA 1 oligonucleotide and the upper strand of the adapter are 5'-phosphorylated, while the adapter 
lower strand is not. 
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Scheme 3 - Adapters used during HSA cloning 



Apal .EcoRI 

CGGACGGCGACGGCGACGGCGACCG 
CCCGGGCCTGCCGCTGCCGCTGCCGCTGGCTTAA Adapter 1 



Apal EcoRI 

CGAGTATGCGACAGCTGG 
CCCGGGCTCATACGCTGTCGACCTTAA Adapter 2 



Apal SacI EcoRI 

I i 
CTGGAGCTCAGTCTG 

CCGGGACCTCGAGTCAGACTTAA Adapter 3 



Scheme 4 - HSA I fragments 



p stl Sau3AI 

Asp Ala His Lys Ser Glu Val Ala His Arg Phe Lys 
GACGCTCACAAGTCTGAAGTCGCTCACAGATTCAAG 

No 1 ACGTCTGCGAGTGTTCAGACTTCAGCG AGTGTCTAAGTTCCTAG 



PstI Sau3Al 

Bell M at Asp Ala His Lys Ser Glu Val Ala His Arg Phe Lys 
GfTGATCffTGGACGCTCACAAGTCTGAAGTCCCTCACAGATTCAAG 

No 2 ACG TC AC TAGTACCTGCG AG TGTTCAGACTTCAGCG AGTGTCTAAGTTCCTAG 



10 



15 



20 



25 



Adapter 1 was used to facilitate cloning of most of the HSA oligonucleotides 

Adapter 2 was used for HSA 16, 17 and 18 oligonucleotides 30 
Adapter 3 was used. to replace Adapter 1 downstream of the HSA gene, in order to introduce a SacI site 

necessary to clone the HSA gene into the E. coli vector pPT2HKi containing the yeast promoter and 

terminator regions. 
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Scheme 5 - Clort-ing the complete HSA gene versions into pUC19 



Pstl EcoRI 

H »- 



pUC19 



Pstl-EcoRI 



Pstl EcoRI 



Hind III Sau3Al 

— t t — h — I — I- 



Apal Sac I EcoRI 



Pstl Sau3AI 

I 1 

HSAI 



II III IV 
pHSA (II-V) 



1, Hind III-EcoRI, 

isolate small fragment 
2, Sau3Al 



Sau3AI 

i-i — h- 

II III IV 
HSA (II-V) 



Apal Sad EcoRI 

— i 1 \ 



la DNA ligase 



Pstl Sau?AI 
4/ * 



I i r 
II in 



pHSA 



Apal SacI EcoRI 

-f 1 — h- 



IV 



pHSA No 1: HSA I = HSAI No 1 
pHSA No 2: HSA I = HSAI No 2 
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Scheme 6 - Obtaining Met-HSA coding DNA piece from pHSA No 2 



Pstl 



Bell 



Sad 



T G C A G'T G A T C fCT GGACGCT — 
-GACGTCACTAGTACCTGCG A — 



— GAGCTC- 
--CTCGAG- 



Bc.l I 



S a c I 



GATCATGG ACGC T- 
TACC TGCGA- 



blun t - end 



G A G C TC- 
— C TCG AG 



1, racing bean nuclease 
2 , Sac I 



Met Asp Ala 
ATGGACGCT- 

TACCTGCGA- 



GAGCT 

C 5 ac I 



Note: To obtain the Met-HSA coding region an unique res trie 
tion site was introduced into HSAI {then into pHSA), 
namely Bell recognition sequence into HSAI (then 
into pHSA resulting in pHSA No 2 version). 
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Scheme 7 - Obtaining mature HSA coding DNA from pHSA No I 



Pst I 
-CTGCAGACGCT- 

-GACGTCTGCGA- 



Sac I 
-G AGCTC- 



CTCGAG 



Ps tl 



GACGCT- 
ACGTCTGCGA- 



Sad 
— G ACCTC- 
— C TCGAG" 



1, Klenow polymerase + dN TP 

2, SacI 



Asp Ala 
GACGCT- 

blunt-end CTGCG A' 



-GAGCT 

— C SacI end 
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Synthetic primers used to sequence parts of the HSA gene 

When HSA oligonucleotides were cloned into pUC19 vector, (or another HSA nucleotide cloned into a 
5 previously obtained pHSA vector), a sequencing primer GTAACGCCAG GGTTTTCCCAGT synthesized 
previously and named as pKO primer I (A. Simoncsits, M. Kalman, I. Cserpan and C. Kari, Nucleic Acids Res. 
Symp. Ser. No 14, 1984, 321-322) was used to check the sequence of the clones obtained. This primer is 
located between nucleotide positions 348-369 of the published pUC19 sequence [Yanisch-Perron. C, Vieira, J. 
and Messing, J. Gene, 33 (1985) 103-119], and used to sequence all of the individually cloned HSA 
10 oligonucleotides (all 24). 

When HSA II and HSA HI large fragments were joined, the joining point was checked by a synthetic primer 
GCAGCCTTGTCGGCAGCTTG, which is complementary to the HSA gene sequence between nucleotide 
positions 508-527 (for mature HSA). 
HSA IV and HSA V junction was checked by using CGTGCAAAACACATAATTGG primer which is 
15 complementary to HSA gene sequence between positions 1374-1393. Junction of HSA III and IV large 
fragments in pHSA (ll-V) was checked by using HSA 10 oligonucleotide itself as a sequencing primer. 

When the HSA gene synthesis is completed in pUC19 vector the whole HSA coding region was sequenced 
using plasmid template and 10 different sequencing primers. Further confirmation of the HSA coding 
sequence was obtained when it was replaced from the pUC19 vector into M 13mp1 9 vector (Yanisch-Perron. C. 
20 etc) and sequencing was performed on single-stranded phage DNA template using the same 10 primers. 

Synthetic primers to check the whole HSA coding region 
25 either in -pUC19 or in Ml3mpl9 

primer name nucleotide position in the 

30 HSA coding region 

pKO primer I outside of HSA, in the lacZ 

part of pUC19 

35 



40 



45 



50 



55 



60 



pHSA primer 


1 


1587-1603 


pHSA primer 


2 


1398-1414 


pHSA primer 


3 


1195-1211 


pHSA primer 


4 


988-1007 


pHSA primer 


5 


795- 809 


pHSA primer 


6 


582- 597 


pHSA primer 


7 


382- 398 


pHSA primer 


8 


178- 192 


pHSA primer 


9 


66- 85 



The last primer (primer 9) was also used to check 
the junction of the HSA (mature or Met-form) and the 
yeast PH05 promoter-signal sequence when the HSA gene 
was replaced from pUC19 into pPT2HK^ 



65 
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MATERIALS AND METHODS 



Enzymes 

Apal 

EcoRI 

Klenow polymerase 

T4 DNA ligase 

Kpnl 

Sad 

BamHI 

Xbal 

mung bean nuclease 

Hind III 

Sau3AI 

Ball 

PstI 

Xhol 

T4 polynucleotide kinase 

Tl RNase 

Proteinase K 

Bell 

Sail 

Helicase 
Glucuronidase 



Source 
Boehringer 

New England Biolabs (NEB) 

Boehringer 

NEB 

NEB 

NEB 

Boehringer 
NEB 

Pharmacia 

NEB 

NEB 

NEB 

NEB 

Boehringer 

Boehringer 

Calbiochem 

Merck 

NEB 

NEB 

REACT IFS IBF 
Boehringer 



10 



15 



20 



25 



30 



35 



40 



Isotopes 

y- 32 P-ATP (<5000 Ci/mmol) 
a- 32 P-dATP (800 Ci/mmol) 

<x- 35 S-dATP (- 1200 (Ci/mmol) were from Amersham 
35 S-methionine (~ 800 Ci/mmol) were from Amersham 

Chemical synthesis of oligodeoxyribonucJeotides 



45 



50 



Either the phosphate-triester method was used with the help of a manual DNA bench synthesizer (Omnifit), 
using monomer or/and dimer building blocks (Sproat B.S. et al 1983, Tetrahedron Letters 24, 5771), or the 
phosphoramidite method using an automatic Gene Assembler (Pharmacia) according to the manufacturers 
Manual. Chemicals were obtained either from Cruachem (phosphate-triester chemistry) or from Pharmacia 
(phosphoramidite chemistry). 

S'-phosphorylation of the synthetic oligodeoxyribonucleotides 

Enzymatic phosphorylation was performed by using T4 polynucleotide kinase and ATP. Depending on the 
specific requirements, this reaction was performed with either radioactive or non-radioactive ATP. 

a) Phosphorylation with y- 32 P-ATP of high specific activity 

This procedure was used for HSA oligonucleotides to obtain hybridization probes or for 5'-labeling of the 
sequencing primers when the sequencing reactions were carried out on plasmid DNA template. 10 pmol 
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oligonucleotide was dissolved in y- 32 P-ATP (7 uJ, ~ 5000 Ci/mmol, 10 mCi/ml) 250 mM Tris-HCI pH 7.5-50 mM 
MgCl2 (2 and 100 mM DTT {1 |xl). 0.5 u.l T4 polynucleotide kinase (10 U/uJ) was added and the mixture was 
kept at 37°C for 30 min followed by heat treatment (100° C, 3 min) to inactivate the enzyme. The solution was 
diluted according to the further use with either hybridization buffer or with sterile water. The excess of 
5 non-reacted y- 32 P-ATP was not removed. 

b) Phosphorylation with y- 32 P-ATP of low specific activity 

This procedure was employed to label the HSA oligonucleotides before their ligation with adapters. 
10 50 pmol oligonucleotide and 100 pmol y- 32 P-ATP (200 Ci/mmol) were dissolved in 10 u.l reaction volume 
containing 50 mM Tris-HCI pH 7.5, 10 mM MgCI 2 and 10 mM DTT, and 1 ul T4 polynucleotide kinase (10 U/jaI) 
was added. After standing at 37° C for 1 hr, the mixture was heat treated at 100°C for 3 min. 

c) Phosphorylation with non-radioactive ATP 

15 

This procedure was applied to: upper strand of adapter 1 and adapter 2, as well as for both strands of other 
adapter-like molecules like adapter 3 and HSA I fragment oligonucleotides. 

Phosphorylation of the upper strand of adapter 1 and adapter 2 oligonucleotides was performed on large 
scale as follows. 

20 2.2 nmol of oligonucleotide and 20 nmol ATP were dissolved in 100 ul reaction volume containing 50 mM 
Tris-HCI, pH 7.5, 10 mM MgCl2. 10 mM DTT and 10 units of T4 polynucleotide kinase. Reaction was performed 
at 37°C for 1 hr followed by heat treatment at 100° C for 3 min. 

Other non-radioactive phosphorylations were carried out essentially as it was described for the low specific 
activity phosphorylation on 50 pmol oligonucleotide scale but no radioactively labeled y- 32 P-ATP was added to 

25 the reaction mixture. 

Ligation of HSA oligonucleotides with adapter (general procedure) 

25 pmol of 5'- 32 P-phosphorylated HSA oligonucleotide was mixed with 75 pmol of S'-phosphorylated upper 

30 strand adapter oligonucleotide and with 75 pmol of non-phosphorylated lower strand adapter oligonucleotide 
in 50 uJ reaction volume containing 50 mM Tris-HCI. pH 7.5. 10 mM MgCl2. 10 mM DTT, 1 mM ATP. The mixture 
cooled to 15° C and approx. 0.2 uJ (approx. 80 units) of T4 DNA ligase was added. Reaction was performed at 
15°C for 4-16 hrs. 50 uJ of 1 M NaCI and 1 uJ of yeast carrier tRNA (10 u.g/u,l) was added and the 
oligonucleotides were precipitated with 300 u.l ethanot in liquid nitrogen bath for 2 min. The mixture was 

35 centrifuged at 12 000 rpm for 5 min, the pellet was dried and dissolved in 10 ul of gel loading buffer containing 
80% formamide. 10 mM EDTA, 0.05% xylene cyanole and 0.05% bromophenol blue. Separation of the ligated 
HSA oligonucleotide from the non-ligated one (and from the adapter) was achieved by applying the above 
solution onto a 10% acrylamide gel containing no urea. Gel electrophoresis was carried out at 400V for 3-5 hrs 
using 100 mM TBE as gel and running buffer (100 mM Tris, 100 mM boric acid, 2 mM EDTA, pH 8.3). After 

40 radioautography of the gel (2-10 min) 2 major radioactive bands were located, of which the lower band 
corresponded to the non-ligated HSA oligonucleotide while the upper band corresponded to the 
adapter-Iigated HSA oligonucleotide. The gel piece corresponding to the latter was cut out and soaked in 50 
mM NaCI (300 uJ) at 37°C for 10-16 hrs. The supernatant was treated twice with phenol (saturated with 50 mM 
Tris-HCI, pH 8.0, 300 ul) and the oligonucleotide-adapter adduct was precipitated after addition of 30 uJ 3 M 

45 NaOAc, pH 5.2. 1 \i\ yeast carrier tRNA (10 u,g/ui) and 750 ul ethanol. 

The pellet was washed with ethanol, dried and dissolved in 1 0 u.l of sterile water, and an aliquot is counted (in 
a Packard liquid scintillation counter) to estimate the yield of the ligation reaction. The yield, based on the 
starting material 32 P-phosphate HSA oligonucleotide, varied between 20-50% (isolated yield). 
21 of the 24 HSA oligonucleotides were ligated with adapter 1. The exceptions are HSA 16, 17 and 18 

50 oligonucleotides, which were ligated with adapter 2. (For HSA 16, this new adapter was obviously necessary, 
but perhaps it was not a better choice for HSA 17 and 1 8. Anyway, these three oligonucleotides were ligated at 
the same time with adapter 2). 

Bacterial strains 

55 

Most of the HSA containing piasmid transformations and propagations were performed using JM101 E. coli 
(Messing, J. Crea. R. and Seeburg, P.H., Nucleic Acids Res. 9. (1981), 304-321). This strain has the following 
genotype: supE, thi, A(lac-proAB), [F. traD36, proAB, lacl"ZAM15]. 
A dam- E. coli strain (GM2) (Morinus, M.G. and Morris, N.R. (1973) J. Bact. 114, 1 143-1 150) was used for 
60 piasmid propagation before Bell enzyme manipulation was required. 

During pBY2/HSA No 1 and pBY2/HSA No 2 constructions an E. coli (K12) strain JF1754 strain (hsd R 
hsdM* lac gal met leu B his B) was used as host, references: Storms, R.K.. McNeil. J.B., Khanendekar, P.S.. 
An, G.. Parker, J. and Friesen, J.D. (1979), J. Bacteriol. 140, 73-82; Kiss. G.B., Amin. A.A. and Perlman, R.E. 
(1981) Molecular and Cellular Biology, 535-543. 
65 The leu B and his B mutations of JF1754 can be complemented with the corresponding yeast genes (leu 2 
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and his 3, respectively), reference: Struhl. K. and Davis, R.W. (1980) J. Mol. Biol. 136, 309-332. 
Yeast strain 

AH220 [a, trp 1 , leu 2-3, 2-112, his 3-1 1 , 3-1 5, pho 5. pho 3] laboratory haploid strain was obtained from A. 5 
Hinnen, CIBA-CEIGY AG, Biotechnology Department, Basel, Switzerland. 

E. coli transformation with plasmid and phage vectors 

This was performed essentially as described by Hanahan, D. (in DNA Cloning, Vol I. Edited by Glover, D.M., 10 
IRC Press Limited 1985, pp 109-135) using frozen competent cells prepared according to Protocol 3 of this 
article. 



Yeast transformation 

Yeast spheroplasts prepared by helicase treatment of AH220 were transformed according to Hinnen et al 
(Hinnen, A, Hicks, J.B. and Fink, G.R. (1978) Proc. Natl. Acad. Sci: USA, 75, 1929-1933). 

Plasmid preparation 



Restriction enzyme cleavage of plasmid DNA 



15 



20 



We used the slightly modified version of the rapid alkaline extraction procedure (Birnboim, H.C. and Doly, J. 
(1979) Nucleic Acids Res., 7, 1513-1523) Minipreps: 

Single colony was inoculaled into 3 ml of LB-medium (Maniatis, T. Fritsch., E.F. and Sambrook, J. (1982) 
Molecular Cloning, Cold Spring Harbor Laboratory, N.Y. p 440) containing 100 u.g/ml ampicillin and the culture 
was shaken at 37° C for 10-18 hrs. Cells were harvested by centrifugation and resuspended in 100 uJ of solution 25 
I (50 mM glucose, 25 mM Tris-HCI pH 8.0, 10 mM EDTA) and left at room temperature for 5 min. 200 uJ of freshly 
prepared solution II (0.2 N NaOH, 1<>/o SDS) was added and the solution was briefly vortexed, then put on ice 
for 5 min. Ice-cold solution III (150 u,l, 3M potassium acetate-2M acetic acid) was added and the mixture was 
briefly vortexed, then put on ice for 15 min. The mixture was centrifuged at 12000 rpm for 5 min and 400 pj of 
the supernatant was pipetted into a fresh tube. 800 uJ of ethanol was added and the mixture was left to stand 30 
for 5 min, then spun at 12000 rpm for 2 min. The pellet was redissolved in 400 uJ of 100 mM Tris-HCI, pH 8.0-50 
mM NaOAc, pH 6.5, and 1 mi of 95% ethanoi was added. After standing at -20° C for 30 min, the mixture was 
spun at 12000 rpm for 2 min. The pellet was dried and dissolved in 100 uJ of 10 mM Tris-HCI, pH 8.0-1 mM EDTA 
solution containing 0.5 U of TiRNase, and the solution was kept at 37° C for 30 min, then extracted with 100 ul 
of phenol saturated with 50 mM Tris-HCI, pH 8.0. The aqueous phase was taken (approx 90 uJ), 10 uJ of 3M 35 
sodium acetate pH 5.2 was added followed by 260 uJ of 95% ethanol and quick cooling of the mixture in liquid 
nitrogen bath. After centrifugation (12 000 rpm, 3 min) the pellet was redissolved in 200 uJ of 0.3M NaOAc, pH 
5.2 and 500 (jJ of 95% EtOH was added to precipitate the nucleic acid as above (quick chilling En liquid nitrogen 
bath followed by centrifugation). The pellet was washed with 1 ml of 95% EtOH, dried and dissolved in 30 uJ of 
sterile water. 40 

The yield of the plasmid DNA was estimated as 3-5 jig. For agarose gel electrophoresis and restriction 
analysis, 1-2 jil of the above solution was used, while 3 uJ was used for sequencing reactions. When the above 
obtained plasmid was used for further cloning experiment 20 ]xl solution was taken for the linearisation with 
one or usually with two enzymes followed by linear vector isolation. 



45 



All the analytical restriction analyses were performed according to the manufacturers recommendations 
except for that BSA was always omitted from the reaction buffers. 

When a particular plasmid is cleaved on preparative scale with one or more enzymes simultaneously or 50 
sequentially reaction conditions are always given. 

pUC19 cleavage with two different restriction enzymes 

Generally, the first HSA oligonucleotides of the large HSA fragments (II, III, IV and V) are cloned into pUC19 55 
vector cleaved with two different enzymes. According to this original plan, only HSA 1, HSA 7, HSA 13 and HSA 
19 oligonucleotides, ligated previously with an adapter (adapter 1) are cloned into pUC19. During the gene 
assembly work, however, it turned out to be more advantageous (or quicker) to clone two more HSA 
oligonucleotides, namely HSA 4 and HSA 17, into pUC19 rater than into the corresponding intermediate pHSA 
vectors. eo 



a) pUC19 cleavage with Pstl and EcoRI 

2 ng of pUC19 was treated in 100 uJ of high salt buffer (100 mM NaCI, 50 mM Tris-HCI. pH 7.5, 10 mM 
MgCl2, 1 mM DTT) with 20 units of Pstl and 20 units of EcoRI at 37° C for 4 hrs. DNA was ethanol precipitated by 65 
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adding 5 ul of 3M sodium acetate, pH 5.2 and 300 ul of ethanol, chilling the mixture for 2 min in liquid nitrogen 
bath followed by centrifugation at 12000 rpm for 3 min. The pellet was dried and dissolved in 60 u.l of Wo Ficoll 
400, 0.050/o bromophenol blue and the linear vector was isolated after separation by electrophoresis on a O.50/0 
agarose gel in 40 mM Tris-acetate, 2 mM EDTA buffer (TAE buffer) followed by electroeiution, phenol 
5 extraction and ethanoi precipitation promoted by adding 10 ug of yeast carrier tRNA [Maniatis, T., Fritsch, E.F. 
and Sambrook, J. Molecular Cloning, Cold Spring Harbor Laboratory (1982) pp. 164-166]. The pellet obtained 
was dissolved in 10 ul of sterile water and the concentration of the linear vector was estimated by minigel 
method (ibid., pp. 468-469). 
This vector was used for cloning: HSA 1 

10 

b) pUC19 cleavage with BamHI and EcoRl 

2 jig of pUC19 was treated in 100 ul of high salt buffer with 20 units of BamHI and 20 units of EcoRl as above, 
and the isolation of the linear vector was performed essentially in the same way as described above. This 
15 vector was used for cloning: HSA 13, HSA 17. 

c) pUC19 cleavage with Xbal and EcoRl 

This was done using 20 units of Xbal and 20 units of EcoRl for 2 ug of pUC19 essentially as described above. 
20 Xbai-EcoRI pUC19 vector was used for cloning: HSA 4. HSA 7, HSA 19. 

Cleavage of the intermediate pHSA vectors with Apal and EcoRl 

20 ul of pHSA plasmid prepared as described before was made up to 100 ul reaction volume containing 6 
25 mM Tris-HCI, pH 7.4, 6 mM NaCI, 6 mM MgCl2, 1 mM DTT and 40 units of Apal enzyme and the reaction mixture 
was kept at 37° C. A 2 ul sample was run on a 0.5<>/o agarose gel in TBE buffer (89 mM Tris, 89 mM boric acid, 8 
mM EDTA). When the cleavage seemed to be complete (after 1-4 hrs). 10 ul of 1M NaCI, 5 ul of 1 M Tris-HCI, pH 
7.5 and 20 units of EcoRl were added and the mixture was kept for further 4-16 hrs at 37° C. The linear vector 
DNA was precipitated with ethanol and purified on a O.50/0 agarose gel as described before for linear pUC19 
30 vector isolation. 

This Apal-EcoRl double digestion was performed with the following plasmids: pHSA 1,2. 4, 5,7,8. 9, 10. 11, 
13, 14, 15, 17, 19, 20, 21. 22, 23. 

Cloning of HSA oligonucleotide-adapter complexes into pUC19 or into pHSA vectors 

35 

General procedure: 

Approx. 0.1 jig of double-cleaved pUC19 or pHSA vector was mixed with 5 pmol of HSA 
40 oligonucleotide-adapter complex in 10 ul reaction volume containing 50 mM Tris-HCI, pH 7.5, 10 mM MgCte, 10 
mM DTT, 1 mM ATP and 80 units of T4 DNA ligase and the reaction mixture was kept at 15°C for 4-16 hrs. The 
mixture was heated to 60° C for 5 min and, after cooling to room temperature, 1 ul of 1 mM dNTP (containing all 
four deoxynucleoside 5'-triphosphates at 1 mM concentration) and 1 ul of 0.5 units/ul Klenow polymerase was 
added and the reaction mixture was left at room temperature for 15 min. It was then heated at 60° C for 10 min 
45 and 4 ul of sterile water, 2 ul of a buffer containing 250 mM Tris-HCI, pH 7.5 and 50 mM MgCl2, 1 ul of 100 mM 
DTT, 1 uJ of 10 mM ATP and 200 units of T4 DNA iigase were added at 15°C. The reaction mixture was kept at 
15°C for 6-20 hrs and then transformed into frozen competent JM101 E. coii cells as referred earlier. The 
colonies obtained on LB plates containing 100 ug/ml ampicillin were picked onto LB-ampicillin master plate 
and onto nitrocellulose replica plate [Grunstein, M. and Hogness, D. (1975) Proc. Natl. Acad. Sci. USA 72, 
50 3961]. The colonies grown up on the nitrocellulose replica plate were iysed and hybridized with the 
corresponding 5'- 32 P-phosphate labeled HSA oligonucleotide probe [Maniatis, T. t Fritsch, E.F. and Sambrook, 
J. Molecular Cloning (1982), Cold Spring Harbor Laboratory, pp. 314-325]. Usually 4-10 positive colonies were 
grown up in 3 ml of LB-ampicillin medium and plasmid DNA was prepared as described previously. 

55 Dideoxy sequencing on plasmid template 

The supercoil sequencing method [Chen, E.Y. and Seeburg. P.H. DNA 4, 165. (1985)] was performed with a 
few modifications. 3 ul of plasmid DNA prepared as before was mixed with 17 ul of 0.3M NaOH-0.3 mM EDTA 
at room temperature. After 5 min 3 ul of 2M ammonium acetate-acetic acid, pH 4.5 and 60 ul of ethanol were 

60 added and the mixture was kept at -80° C for 15 min. The mixture was centrifuged { 1 2 000 rpm, 5 min) and the 
pellet was washed with 70% EtOH, dried and dissolved in 10 ul of buffer containing 7 mM Tris-HCI, pH 7.5, 7 
mM MgCl2, 5 mM p-mercaptoethanol, 0.1 mM EDTA and 0.25 pmol of 5'- 32 p-phosphate labeled sequencing 
primer (sequencing primers used during the work are shown in Scheme 10 and above. Sometimes one of the 
HSA oligonucleotides was also used as sequencing primer despite the fact that they contain a 3'-terminal 

65 GGCC extra sequence). The mixture was heated at 45°C for 15 min. Then four 2 ul aliquots were pipetted into 
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wells of a microtiter plate. 2 pJ of each four dideoxy termination mixtures [Hong, G.F. Bioscience Reports, 2, 
907 (1982)] and 2 uJ of 0.25 units/jil Kienow polymerase were mixed with each of the four aiiquoted 
primer-template and the mixtures were kept at room temperature for 20 min, then at 50° C for 10 min. To each 
reaction mixture 3 jxl of gel loading buffer containing 80% formamide, 10 mM EDTA, 0.05o/o bromophenol blue 
and O.OSO/o xylene cyanol was added and the mixtures were heated at 100°C for 2 min. Gel electrophoresis was 
carried out on a 6<>/o acrylamide gel containing 8M urea, 90 mM Tris, 90 mM boric acid, 2 mM EDTA, pH 8.3. 

Cloning the individual HSA oligonucleotides (HSA 1, 2, ....24) 

The original plan was the following: 

The whole HSA coding region was divided into five fragments, HSA I, II, III, IV and V. The latter four fragments 
(II, 111, IV and V) were further divided into 6-6 single-stranded oligonucleotides (each ending at the 3' -terminus 
with G and supplied with an extra GGCC sequence by chemical synthesis), altogether 24 oligonucleotides. The 
HSA large fragments (II, III, IV and V) were to be obtained by consecutive clonings of the synthetic, 
single-stranded oligonucleotides (with the help of an adapter) into pUC19 or pUC19 derived pHSA vectors, 
exemplified here with pHSA II: 
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Similarly, pHSA III was obtained from HSA 7, 8, 9, 10, 11, 12 oligonucleotides. pHSA IV was obtained from 
HSA 13, 14, 15, 16, 17, 18 oligonucleotides. pHSA V was obtained from HSA 19, 20, 21, 22, 23, 24 
oligonucleotides. 

This general strategy was usually employed for cloning HSA oligonucleotides with the help of adapter 1 (see 
Scheme 3), but we deviated from this in a few cases. The reasons to do so were either to speed up the 
assembly work by parallel cloning of more than one oligonucleotide within a large fragment (like in case of HSA 
II) or to solve cloning problems we encountered during the work. 
These exemptions are: 

HSA 1 oligonucleotide could be cloned as a whole only with the help of a partial duplex (at the 5'-terminus of 
HSA 1) 

HSA It large fragment was obtained as pHSA II from the previously cloned HSA (1-3) and HSA (4-6) DNA 
segments 

HSA 15 oligonucleotide could only be cloned to obtain the correct sequence with the help of a 

complementary oligonucleotide, which covered nearly 2/3 part of the original HSA 15 

HSA 16 oligonucleotide could only be cloned with the help of a new adapter (adapter 2) 

HSA 17 oligonucleotide could not be cloned into pHSA (13-16) so that the expected pHSA (13-17) be 

obtained. Although HSA 17 sequence was found in the obtained piasmids, deletions in the previously cloned 

regions were observed. So HSA 17 was cloned into pUC19. 

HSA 18 oligonucleotide was cloned into pHSA 17 with the help of adapter 2 

HSA IV large fragment was obtained from the previously cloned HSA (13~16) and HSA (17-18) DNA 
segments. 
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Cloning HSA 1 into pUC19 

When HSA 1 oligonucleotide ligated with adapter 1 (HSA 1+At) was tried to be cloned into BamHI-EcoRI 
cleaved pUC19 by the cloning procedure described in details before, the complete HSA 1 region was not 5 
obtained in cloned form. About 50 clones hybridizing with 5'- 32 P-labeled HSA 1 oligonucleotide were 
sequenced, and it was found that mast of the clones lacked the 5' -terminal T residue of HSA 1 (the rest of them 
lacked more than one residue). 

A new strategy was used then to get the whole HSA 1 cloned as follows. Pstl-EcoRI cleaved pUC19 was 
used as a cloning vector and a synthetic, partial duplex having a Pstl sticky end at the 5' -terminus and a 10 10 
nucleotide long 5'-protruding region at its 3'-terminus. which latter region is complementary to the S'-terminal 
region of HSA 1, was included in the reaction mixture. The use of this "helper duplex" is shown in Scheme 11. 

0.1 u,g of Pstl-EcoRI cleaved pUC19 vector was mixed with 5 pmol of HSA 1+Ai, 5 pmol of 
S'-phosphorylated GTGCQATC and 5 pmol of 5'-phospharyIated TCTTCACCTAGATCGCACTGCA in 10 uJ 
reaction volume and the whole cloning procedure was performed essentially as described in the general 15 
procedure. 

100 colonies were checked by hybridization with 5'- 32 P-labeIed HSA 1 oligonucleotide as a probe. Of the 29 
positive clones, 10 were used for sequencing with the help of the pKO primer I and the reverse primer. 8 of the 
10 sequenced clones contained the correct sequence. Plasmid DNA of one of the proper clones was used in 
the next step as pHSA 1 for Apal-EcoRI double digestion and for cloning the HSA 2 oligonucleotide. 20 
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Cloning HSA 2 Into pHSA 1 

0.1 ng of Apal-EcoRI cleaved pHSA 1 was mixed with 5 pmol of HSA 2 + Ai in 10 jil reaction volume and the 
cloning steps were performed as described above. (Scheme 12). 5 

40 colonies were replica plated and hybridized with 5'- 32 P-HSA 2 oligonucleotide as a probe. 9 positive 
colonies were obtained, plasmid DNA prepared from them and they were sequenced using pKO primer I. 5 
clones contained the correct HSA 2 sequence. Plasmid DNA from one of the correct clones [pHSA (1-2)] was 
used in the next step to clone HSA 3 oligonucleotide. 
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Cloning HSA 3 into pHSA (1-2) 

0.1 \ig of Apal-EcoRI cleaved pHSA (1-2) was reacted with 5 pmol of HSA 3 + Ai in 10 \i\ reaction volume as 
described before (Scheme 13). 

187 colonies were replica plated and hybridized with 5'- 32 P-HSA 3 oligonucleotide probe. Of the 42 positive 
clones, 10 were used for preparing plasmid DNA and they were sequenced using pKO primer I. 1 clone was 
correct and the plasmid prepared from it was called pHSA (1-3). 

pHSA (1-3) was used later to clone HSA (4-6) DNA segment (see later). 
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Cloning HSA 4 into pUC19 

0.1 u.g of Xbal-EcoRI cleaved pUC19 and 5 pmol of HSA 4+Ai were reacted in 10 \i\ reaction volume as 
described before (Scheme 14) 5 

110 colonies were picked and 45 of them showed hybridization with 5'- 32 P-HSA 4 oligonucleotide probe. 
Ptasmid DNA was prepared from 4 positive clones, they were sequenced using pKO primer I. and all of them 
were found to contain the correct HSA 4 sequence as well as the expected flanking regions. The pHSA 4 
obtained so contained the regenerated Xbal site at the 5'-terminus of HSA 4, which could later be eliminated at 
the junction point between HSA 3 and HSA 4 oligonucleotides. 10 

pHSA 4 was used to clone HSA 5 in the next step. 
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Cloning HSA 5 into pHSA 4 

0.1 (xg of Apal-EcoRI cleaved pHSA 4 and 5 pmol of HSA 5 + Ai were reacted according to the general 
procedure (Scheme 15). 

65 colonies were hybridized with 5'- 32 P-HSA 5 probe and 3 of them proved to be positive. Plasmid DNA was 
prepared from the positive clones and one of them was correct, this was called pHSA (4-5) and was used to 
clone HSA 6 in the next step. 
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Cloning HSA 6 into pHSA (4-5) 

0.1 \lq of Apal-EcoRI cleaved pHSA (4-5) and 5 pmol of HSA 6-h Ai were reacted according to the general 
procedure (Scheme 16). 5 

225 colonies were replicated and hybridized with 5'- 32 P-HSA 6 oligonucleotide probe. 72 proved to be 
positive and 10 of the latter were used to prepare plasmid DNA. Of the 10 sequenced (using pKO primer I) 
plasmids 2 contained the correct HSA 6 sequence, one of these plasmids was called pHSA (4-6). 

pHSA (4-6) was used later to obtain HSA (4-6) DNA segment which was cloned into pHSA (1-3) to obtain 
pHSA (.1-6), or pHSA II. 10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



39 



EP 0 308 381 A1 



0) 
O 

to 











> 




CO 




<D 




r-H 




a 










in 


o 


I 


u 








M 


< 


kj 




Oi 









E-« 



a 
a 
u 

u cu 

o u 

*c £h 

o u 

u o 

< Eh 



< 
< 

U U 



a a 

o a 

o u 

o a 

o o 
u 

E-» 
U 
Eh 



^« 

4- 

0 
cn 
cd 
jh 
cu 

>. 

CP r-H 
-H O "-"I 
^H CU «-H 



cn 



to 



Q 
Eh 



5 
O 
C 
Q) 
i— i 



< 

a 



r-4 CN rn 



< 
cn 



a 

& 
u 

Eh 

a 
< 

< 



< 
CO 



Eh < 

Eh < 

< E- 

< Eh 



o o 
o o 
djo o 
o u._ 
u u 

^ < 
^ < 

£h < 



o u 

E^ < 
Eh < 



Eh 



< 



< Eh 

C3 a 

< Eh_ 

o u 

< Eh 

< Eh 

o u 

u o 

< Eh 

< e 



CO 



< 
cn 



40 



EP 0 308 381 A1 



Cloning HSA 7 into pUC19 

0.1 p,g of Xbal-EcoRI cleaved pUC19 and 5 pmol of HSA 7 + Ai were reacted according to the general 
procedure (Scheme 17). 5 

The HSA 7 containing clones were selected according to a color reaction. After transformation, the 
transformed cells were plated in the presence of IPTG (IPTG: isopropyl-p-D-thiogalactopyranoside) and X-gai 
(X-gal: 5-bromo-4-chloro-3-indoyl-[}-galactoside) [Vieira, J. and Messing, J. (1982) Gene 19, 259-268]. White 
colonies in blue background were expected to contain the correct HSA sequence. 

10 randomly picked white colonies were inoculated into LB-ampicillin medium and plasmid DNA prepared 10 
from them were sequenced using pKO primer 1. 7 of them contained the correct HSA sequence, one of them 
was used later as pHSA 7 to clone HSA 8 in the next step. 

HSA 7 oligonucleotide as the first oligonucleotide of HSA III fragment, contains an extra GGTAC 5'-terminal 
sequence, which was introduced in order to be able to use this sequence, forming a Kpnl site together with the 
next C residue, to join HSA 111 large fragment with HSA II large fragment. This extra sequence should disappear 15 
after performing the relevant reactions, so this sequence is not included as a part of HSA 7 when it is already 
cloned in pHSA 7 as shown in Scheme 17. 
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Cloning HSA 8 into pHSA 7 

0.1 u.g of Apal-EcoRI cleaved pHSA 7 was reacted with 5 pmol of HSA 8 + A1 according to the general 
procedure (Scheme 18). 

240 colonies were tested by hybridization probe 5'- 32 P-HSA 8 and 6 of them were postive. 2 of them 
revealed the correct HSA 8 sequence after sequencing with pKO primer I. Plasmid DNA of a correct clone was 
carried through the general cloning strategy as pHSA (7-8) to clone HSA 9 in the next step. 
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Cloning HSA 9 into pHSA (7-8) 

0.1 \ig of Apal-EcoRI cleaved pHSA (7-8) and 5 pmol of HSA 94- Ai were reacted according to the general 
procedure (Scheme 19). 

240 colonies were replica plated and hybridized with 5'- 32 P-HSA 9 oligonucleotide probe. Plasmid DNA was 
prepared of 8 of the 13 positive clones and sequenced. 3 of the 8 sequenced clones contained the correct 
pHSA (7-9) plasmid. 
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Cloning HSA 10 into pHSA (7-9) 

0.1 ug of Apal-EcoRI cleaved pHSA (7-9) and 5 pmol of HSA 10 + Ai were reacted in the usuaf way (Scheme 
20). 

Of 202 colonies 58 showed hybridization with 5'- 32 P-HSA 10 oligonucleotide probe. 10 positive clones were 
used to prepare plasmid DNA for sequencing and 8 of them contained the proper pHSA (7-10). 
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Cloning HSA 11 into pHSA (7-10) 

0.1 u.g of Apal-EcoRI cleaved pHSA (7-10) and 5 pmol of HSA 11+Ai were reacted in the usual way 
(Scheme 21). 5 

160 colonies were replica plated and 9 of them proved to be positive after hybridization with 5'- 32 P-HSA 1 1 
oligonucleotide probe. Plasmids prepared from the positive clones were sequenced by using pKO primer I. 
One clone was found to contain the expected sequence and plasmid DNA prepared from this clone was used 
in the next step as pHSA (7-11). 
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Cloning HSA 12 into pHSA (7-11) 

0.1 \ig of Apal-EcoRI cleaved pHSA (7-11) were reacted with 5 pmol of HSA 12 + Ai according to the general 
procedure (Scheme 22). 

240 clones were tested and 11 of them proved to be positive after hybridization with 5'- 32 P-HSA 12 
oligonucleotide probe. 2 of 10 sequenced (pKO primer I) plasmid DNA seemed to be correct and one of them 
was used later as pHSA (7-12) or pHSA III, i.e. the large HSA III fragment containing plasmid. 

The sequence of the HSA (7-12), or HSA III fragment was confirmed by sequencing in M13mp19 and mp18 
vectors. pHSA III was cleaved with Pstl and EcoRI, the small fragment was isolated and cloned into PStl-EcoRI 
cleaved M13mp18 and mp19 vectors. Single-stranded phage DNA was prepared from the recombinants and 
they were sequenced using the 17-mer primer. 
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Cloning HSA 13 into pUC19 

0.1 u,g of BamHI-EcoRI cleaved pUC19 and 5 pmol of HSA 13 + Ai were reacted according to the general 
procedure (Scheme 23). 5 

80 clones were tested by hybridization with 5'- 32 P-HSA 13 probe and 14 of them were found to be positive. 
Plasmid DNA was prepared from 10 positive clones and sequenced using pKO primer I. 6 of them were 
identical with the expected pHSA 13 plasmid. 

HSA 13 oligonucleotide, like HSA 7, contains the extra GGTAC 5' -terminal sequence, as this oligonuleotide 
is the first one in the HSA IV large fragment. In this case a Kpnl site was also formed, which can be used later to to 
join HSA III and HSA IV large fragments so that this extra sequence is eliminated at the joining point. 

pHSA 13 was used to clone HSA 14 in the next step. 
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Cloning HSA 14 into pHSA 13 

0.1 ng of Apal-EcoRI cleaved pHSA 13 was reacted with 5 pmol of HSA 14+Ai according to the general 
procedure (Scheme 24). 

2 of 160 clones tested for by hybridization with 5'- 32 P-HSA 14 were positive. After plasmid preparation and 
sequencing by pKO primer I, one of the two plasmid DNAs contained the expected sequence. It was called 
pHSA (13-14) and used in the next step. 
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Cloning HSA 15 into pHSA (13-14) 

When HSA 15 + Ai was tried to be cloned into Apal-EcoRl cleaved pHSA (13-14) according to the general 
procedure, a large number of colonies hybridizing with 5'- 32 P-HSA 15 were obtained, but after sequencing 5 
their plasmids, the expected HSA 15 sequence was never found. Instead, a double-mutated HSA 15 was 
obtained, in which G— *T mutation took place at nucleotide positions 1072 and 1096 (nucleotide positions in 
mature HSA gene sequence). These G residues were surrouded by T residues. The possibility that these 
apparent mutations were merely due to ambiguous gel reading which occurs sometimes using plasmid DNA 
template, was excluded after recloning the region of interest into M13mp19 phage vector. Since these two w 
mutations took place at the same time in ail cases (19 different positive clones were tested), we had to change 
the general cloning strategy in this case so that a complementary oligonucleotide covering the sites of 
mutations in HSA 15 was employed. 

A 42-mer complementary oligonucleotide was prepared and it was included into the ligation mixture of 
5'-phosphate HSA 15 and adapter 1. 50 pmol of 5'- 32 P-HSA 15 was mixed with 100 pmol of 5'-phosphate 15 
42-mer, 100 pmol of 5'-phosphate adapter 1 upper strand and 100 pmol of 5'-hydroxyl adapter 1 lower strand 
oligonucleotide. The partial duplex was isolated after gel electrophoresis as described previously in ~ 30<Vo 
yield based on HSA 15. This partial duplex is named as HSA 15 4- C + Ai in Scheme 25. 

Next, 0.1 jig of Apal-EcoRl cleaved pHSA (13-14) was reacted with 5 pmol of HSA 15 + C + Ai and the 
reactions were performed according to the general procedure. 340 clones were tested by 5'- 32 P-HSA 15 20 
probe and of the 17 positive clones 12 were used to prepare plasmid DNA. They were sequenced (pKO primer 
I) and 2 of them contained the expected HSA 15 sequence. This sequence was confirmed by recloning the 
HSA (13-15) region obtained so into M13mp19 phage vector and by performing the sequencing reactions on 
single-stranded DNA template. 

One of the proper plasmids was used as pHSA (13-15) in the next step. 25 
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Cloning HSA 16 into pHSA (13-15) 

When HSA 16 + Ai was cloned into Apal-EcoRI cleaved pHSA (13-15), ail the 16 sequenced positive clones 
had a 9 base pair deletion at the 5' -terminus of the HSA 16 region. Reexamination of the HSA 16 sequence 
revealed that its S'-terminal region and the 5' -terminal region of adapter 1 lower strand were nearly perfectly 
complementary. We planned to use a different Apal-EcoRI adapter lacking this complementary region, 
(adapter 2, see Scheme 3). HSA oligonucleotides were ligated with adapter 2 exactly in the same way as with 
adapter 1. 

0.1 jag of Apal-EcoRI cleaved pHSA (13-15) was reacted with 5 pmol of HSA 16 + A2 according to the general 
procedure. Of 60 clones 24 were found to be positive after hybridization with 5'- 32 P-HSA 16 probe. 10 positive 
clones were used to prepare plasmid DNA, they were sequenced with pKO primer I and two of them proved to 
be the expected pHSA (13-16). 

pHSA (13-16) was used later to prepare HSA (13-16) DNA region which was cloned into pHSA (17-18) to 
obtain pHSA (13-18), i.e. pHSA IV. 
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Cloning HSA 17 into pUC19 

0.1 \ig BamHI-EcoRl cleaved pUC19 was reacted with 5 pmol of HSA 17 + A2 in the usual way (Scheme 27). 

160 clones were tested by hybridization with 5'- 32 P-HSA 17. 30 of them were positive, and plasmids were 
prepared from 8 of them. 2 clones contained the expected pHSA 17 plasmid according to sequence data (pKO 
primer I). 
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Cloning HSA 18 into pHSA 17 

0.1 u.g of Apal-EcoRI cleaved pHSA 17 was reacted with 5 pmol of HSA I8 + A2 (Scheme 28). 

Of the 160 clones tested by hybridization with 5'- 32 P-HSA 18 probe, 24 were found to be positive. Plasmid 
DNA from 4 of the positive clones was prepared and sequenced using pKO primer I. 2 clones contained the 
correct pHSA (17-18) plasmid. 
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Cloning HSA 19 into pUC19 

0.1 \ig Xbal-EcoRI cleaved pUC19 was reacted with 5 pmol of HSA 19+Ai according to the general 
procedure. (Scheme 29). 5 

Transformed JM101 E. coli cells were plated onto LD-ampicillin plate in the presence of X-gai and IPTG as 
described for pHSA 7. Plasmid DNA from 6 randomly picked white colonies was prepared and sequenced 
using pKO primer I. 2 of them proved to be the correct pHSA 19. 

HSA 19, like HSA 7 and HSA 13, contains the extra GGTAC sequence at its 5'-terminus. This sequence, as 
described before, will facilitate joining HSA V large fragment with HSA IV (see also later). 10 
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Cloning HSA 20 into pHSA 19 

0.1 \Lg of Apal-EcoRI cleaved pHSA19 was reacted with 5 pmol of HSA 20 + Ai as described in the general 
procedure (Scheme 30). 

160 colonies were tested with 5'- 32 P-HSA 20 oligonucleotide probe and 58 of them were positive. Plasmid 
DNA prepared from 1 1 positive clones were sequenced using pKO primer I, and 8 of them contained the 
correct pHSA (19-20) plasmid. 
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Cloning HSA 21 into pHSA (19-20) 

0.1 ng of Apal-EcoRI cleaved pHSA (19-20) and 5 pmol of HSA 21 + Ai were reacted in the usual way 
(Scheme 31). 

33 of 240 clones were found to be positive after hybridization with 5'- 32 P-HSA 21 probe. 6 positiv clones 
were use3 to prepare plasmid DNA and after sequencing with pKO primer i, 3 clones were shown to contain 
the expected pHSA (19-21) plasmid. 
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Cloning HSA 22 into pHSA (19-21) 

0.1 p.g of Apal-EcoRI cleaved pHSA (19-21) and 5 pmol of HSA 224- At were reacted in the usual way 
(Scheme 32). 

80 clones were tested for hybridization with 5'- 32 P-HSA 22 probe t and 10 were found positive. Piasmid DNA 
prepared from them were sequenced using pKO primer I and only one proved to be correct. This is called 
pHSA (19-22). 
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Cloning HSA 23 into pHSA (19-22) 

0.1 p.g of ApaJ-EcoRI cleaved pHSA (19-22) and 5 pmoi of HSA 23 + Ai were reacted according to the 
general procedure (Scheme 33). 

160 clones were tested and 100 of them showed hybridization with 5'- 32 P-HSA 23 probe. Plasmid DNA was 
prepared from 6 positive clones and they were sequenced with pKO primer 1. 3 of them contained the correct 
HSA 23 sequence in the proper surrondings. One of them was used in the next step as pHSA (19-23). 
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Cloning HSA 24 into pHSA (19-23) 

0.1 u.g of Apal-EcoRI cleaved pHSA (19-23) was reacted with 5 pmol of HSA 24+Ai in the usual way 
(Scheme 34). 5 

Of 160 clones, 37 showed hybridization with 5'- 32 P-HSA 24 probe. Plasmid DNA was prepared from 3 
positive clones and they were sequenced with pKO primer I. 2 of the above plasmids contained the correct 
HSA 24 sequence, and one of them was used as pHSA (19-24) or pHSA V later. 

The sequence of HSA V was confirmed after its recloning as a Pstl-EcoRi fragment obtained from pHSA V 
into Pstl-EcoRI cleaved M13mp18 and mp19 vector pair. 10 
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JOINING HSA LARGE FRAGMENTS 

Although the HSA gene was planned to be assembled from 5 large fragments (HSA I, II, III, IV and V), up to 
now only HSA III and HSA V syntheses were demonstrated. HSA I is a flexible 5' -terminal region of HSA and it 5 
was chemically synthesized as a relatively short Pstl-Sau3AI segment (see Scheme 4). HSA II and HSA IV were 
obtained from two DNA segments, i.e. HSA II or HSA (1-6) was obtained from HSA (1-3) and HSA (4-6), while 
HSA IV, or HSA (13-18) was obtained from HSA (13-16) and HSA (17-18). During the assembly of HSA II and 
HSA IV large fragments, reaction series like restriction digestion followed either by mung bean nuclease 
treatment or Klenow polymerase 4- dNTP treatment were employed. These reaction conditions are fully 10 
described for obtaining pHSA It and pHSA IV, and similar reactions, when manipulating with HSA large 
fragments, will only be referred. 

Mung bean nuclease and Klenow polymerase 4-dNTP treatment are used to remove the 5'- or 
3'-overhanging single stranded DNA regions obtained after restriction enzyme cleavage to produce blunt 
ends. Mung bean nuclease removes 5'-protruding ends, while Klenow polymerase + dNTP treatment removes 15 
the 3'-protruding ends. The latter treatment, at the same time, fills in the 5'-protruding end to yield blunt end. 

pHSA II 

HSA II large fragment, cloned in pUC19 as pHSA II, was obtained from HSA (1-3) and HSA (4-6) DNA 20 
segments so that pHSA (1-3) was used as a vector to clone HSA (4-6). 

1 jig of pHSA (4-6) was treated with 10 units of Xbal in 50 u.l of reaction volume containing 100 mM NaCI, 50 
mM Tris-HCI, pH 7.5, 10 mM MgCb, 1 mM DTT (high salt buffer) at 37° C for 1 hr. Linear vector DNA obtained so 
was ethanol precipitated, dried and dissolved in 50 \l\ of mung bean nuclease buffer (30 mM sodium acetate, 
pH 5.0, 100 mM NaCI, 2 mM ZnCfc. 10% glycerol, 0.5 mg/mi denatured calf thymus DNA) and treated with 1 u.l 25 
of 10 U/uJ mung bean nuclease at 37° C for 30 min. The reaction mixture was phenol extracted, then the DNA 
was ethanol precipitated. The pellet was dissolved in 50 \i\ of high salt buffer (see before for Xbal treatment) 
and 20 units of EcoRI was added to the reaction mixture which was kept at 37° C for 1 hr. After ethanol 
precipitation, the small HSA (4-6) fragment was isolated by electrophoresis on a 2% agarose gel (in TAE 
buffer) followed by electroelution and ethanol precipitation. This fragment has a blunt end at the 5'-terminus 30 
and an EcoRI sticky end at the 3'-terminus (Scheme 35). 

1 |ig of pHSA (1-3) was dissolved in 50 uJ of low salt buffer containing 6 mM NaCI, 6 mM Tris-HCI, pH 7.4, 6 
mM MgCIa and 1 mM DTT and treated with 10 units of Apal at 37°C for 1 hr. After ethanol precipitation, the 
pellet was dissolved in 50 u.l of Klenow buffer containing 7mM Tris-HCI, pH 7.5, 7 mM MgCl2, 5 mM 
P-mercaptoethanoI, 0.1 mM EDTA and 0.1 mM dNTP and treated with 0.5 pj of 5 U/uJ Klenow polymerase at 35 
room temperature for 10 min. After phenol extraction and ethanol precipitation, the pellet was dissolved in 50 
u.I of high salt buffer and 10 units of EcoRI was added. The reaction mixture was kept at 37° C for 2 hrs, then the 
DNA was ethanol precipitated. Large vector fragment having a blunt-end and an EcoRI end was isolated by 
electrophoresis on 0.5% agarose gel in TAE buffer followed by electroelution and ethanol precipitation. 
(Scheme 35). 40 

The cleaved pHSA (1-3) vector (0.1 jxg) was ligated with HSA (4-6) fragment (approx. 0.03 jig) in 10 uJ of 
reaction volume containing 50 mM Tris-HCI, pH 7.5, 10 mM MgCb, 10 mM DTT, 1 mM ATP (ligase buffer) and 80 
units of T4 DNA ligase was added at 1 5° C for 12 hrs. The reaction mixture was transformed into frozen compe 
tent JM101 cells and they were then plated onto LB-ampiciiHn plates. Of 110 replica-plated colonies, 55 
showed hybridization with 5'- 32 P-HSA 4 oligonucleotide probe. Plasmid DNA was prepared from 10 positive 45 
clones and they were sequenced by using the reverse primer. 2 of them showed the expected sequence at the 
junction of HSA 3 and HSA 4 oligonucleotides, and they were used later as pHSA (1-6) or pHSA II (Scheme 35). 

(The sequence of HSA II was confirmed after its subcloning into Pstl-EcoRI cleaved M13mp18 and mpl9 
phage vector, and the sequencing reactions were performed on single-stranded DNA template). 
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pHSA IV 

HSA IV large fragment was obtained from the previously cloned HSA (13-16) and HSA (17-18) DNA 
segments so that pHSA (17-18) vector was used to clone HSA (13-16) (Scheme 36). 5 

1 ng of pHSA (13-16) was treated with 20 units of Apal in 50 uJ of low salt buffer for 1 hr at 37°C. The DNA 
was ethanol precipitated and dissolved in 50 uJ of Klenow buffer containing 0.1 mM dNTP and treated with 2.5 
units of Klenow polymerase at room temperature for 10 min. The reaction mixture was phenol extracted and 
ethanol precipitated, and the pellet was dissolved in 50 \i\ of high salt buffer containing 20 units of Pstl at 37° C 
for 1 hr. After ethanol precipitation the small fragment was isolated by electrophoresis on a 2% agarose gel to 
followed by electrocution. This procedure yielded HSA (13-16) DNA segment having a blunt end and a Pstl 
sticky end. (Scheme 36). 

1 u.g of pHSA (17-18) was cleaved with 10 units of BamHI in 50 uJ reaction volume containing high salt buffer 
at 37° C for 1 hr. DNA was ethanol precipitated and dissolved in 50 uJ of mung bean nuclease buffer then 10 
units of mung bean nuclease was added at 37° C for 30 min. After phenol extraction and ethanol precipitation 15 
the pellet was dissolved in 50 uJ of high salt buffer and 20 units of Pstl was added for 1 hr at 37° C. The large 
linear vector fragment was ethanol precipitated, purified by electrophoresis on a 0.5<Vo agarose gel (TAE 
buffer) followed by electroelution. This reaction series resulted in cleaved pHSA (17-18) vector having a 
blunt-end and a Pstl sticky end. (Scheme 36). 

The cleaved pHSA (17-18) vector (0.1 was ligated with HSA (13-16) (approx. 0.05 u.g) in 10 uJ of ligase 20 
buffer containing 80 units of T4 DNA ligase at 15° C for 12 hrs. The mixture was then transformed into frozen 
competent JM101 cells and plated onto LB-ampicillin plates. 230 colonies were tested by hybridization with 
5'.32p-HSA 16 probe and 86 were found to be positive. Plasmid DNA was prepared from 10 clones and they 
were sequenced by using pKO primer 1. 5 plasmid DNA showed the correct junction between HSA 16 and HSA 
17 oligonucleotide regions, and they were used later as pHSA (13-18), or pHSA IV (Scheme 36). 25 

(The sequence of HSA IV was confirmed after its subclontng into phage vector M13mp18 and mp19. 
Sequencing was performed on single-stranded DNA template). 
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pHSA (ll-lll) 

In this case pHSA III was used as a vector to clone HSA III fragment (Scheme 37). 
HSA III fragment preparation 

5 \ig of pHSA IN was treated with 40 units of Kpnl in 100 ui of low salt buffer at 37° C for 3 hrs. The cleaved 
vector was ethanol precipitated and the pellet was dissolved in 50 u.l of Klenow buffer containing 0.1 mM dNTP 
and 2.5 units of Klenow polymerase and the mixture was kept at room temperature for 10 min. After phenol 
extraction and ethanol precipitation, the pellet was dissolved in 50 uJ of high salt buffer, 40 units of EcoRI was 
added and the mixture was kept at 37° C for 2 hrs. The DNA was ethanot precipitated and the HSA III fragment 
was isolated by electrophoresis on a 2<>/o agarose gel in TAE buffer followed by electroelution. The HSA III 
containing large fragment obtained so has" a blunt-end and an EcoRI sticky end (Scheme 37). 

pHSA II vector cleavage 

1 \lq of pHSA II was dissolved in 50 uJ of low salt buffer and treated with 10 units of Apal at 37° C for 2 hrs. 
After ethanol precipitation, the pellet was dissolved in 50 uJ of Klenow buffer containing 0.1 mM dNTP and 
treated with 2.5 units of Klenow polymerase at room temperature for 10 min. The mixture was phenol 
extracted, the DNA was ethanol precipitated and the pellet was dissolved in 50 jxl of high salt buffer, then 20 
units of EcoRI was added. The reaction mixture was kept at 37° G for 2 hrs and the DNA was ethanol 
precipitated. Large vector fragment was isolated by electrophoresis on a 0.5<Vo agarose gel in TAE buffer 
followed by electroelution. The linear vector obtained so has a blunt-end and an EcoRI sticky end (Scheme 37). 

Ligation 

0.2 \ig of cleaved pHSA II vector was mixed with 0.1 u.g of HSA HE fragment in 10 u,l of ligase buffer and 80 
units of T4 DNA ligase was added. The reaction mixture was kept at 15° C for 14 hrs and then it was 
transformed into JM101 E. coll cells. Approx. 50°/o of the ampicilfin resistant colonies showed hybridization 
with 5'- 32 P-HSA 11 oligonucleotide probe. Plasmid DNAS prepared from 8 positive clones were sequenced 
using a synthetic primer complementary with a part of HSA oligonucleotide (between nucleotide positions 
508-527 of the mature HSA gene), and air 8 showed the proper sequence at the junction point of HSA II and 
HSA III large fragments. 
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pHSA (iV-V) 

In this case pHSA V served as a vector to clone HSA IV fragment (Scheme 38). 

5 

HSA IV fragment preparation 

2 u.g of pHSA IV was treated with 10 units of Apal in 50 jii of low salt buffer at 37° C for 2 hrs. The linear vector 
was ethanol precipitated and the pellet was dissolved in 50 uJ of Klenow buffer containing 0.1 mM dNTP and 2.5 
units of Klenow polymerase was added at room temperature for 10 min. After phenol extraction and ethanol 10 
precipitation, the pellet was dissolved in 50 uJ of high salt buffer, 40 units of Pstl was added and the mixture 
was kept at 37° C for 2 hrs. After ethanol precipitation, the small fragment containing HSA IV sequence was 
purified by electrophoresis on a 2<>/o agarose gel in TAE buffer followed by electroelution. The small fragment 
has a Pstl sticky end and a blunt-end. (Scheme 38). 



pHSA V vector cleavage 



Ligation 



15 



2 ng of pHSA V was treated with 10 units of Kpnl in 50 u.l of low salt buffer at 37 D C for 4 hrs. After ethanol 
precipitation, the pellet was dissolved in 50 u.l of Klenow buffer containing 0.1 mM dNTP and 2.5 units of 
Klenow polymerase and was kept at room temperature for 10 min. After phenol extraction and ethanol 20 
precipitation, the pellet was dissolved in 50 uJ of high salt buffer and 40 units of Pstl was added. The mixture 
was kept at 37° C for 4 hrs. After ethanol precipitation, the linear vector was purified by electrophoresis on a 
0.50/0 agarose gel in TAE buffer followed by electroelution. The cleaved pHSA V vector obtained so has a Pstl 
sticky end and a blunt end (Scheme 38). 



25 



Approx. 0.1 u.g linearized pHSA V vector and 0.05 \vg of HSA IV containing fragment was treated with 80 units 
of T4 DNA ligase in 10 uJ of ligase buffer at 15°C for 4 hrs. After transformation into JM101 E. coli cells, the 
ampicillin resistant colonies were tested with 5'- 32 P-HSA 16 oligonucleotide probe and approx. 40<>/o of them 30 
were positive. 8 colonies were used to prepare plasmid DNA, they were sequenced with a synthetic primer 
complementary with a part of HSA 19 oligonucleotide (nucleotide positions 1374-1393 in the mature HSA 
gene) and 7 of them had the correct sequence at the junction point between HSA IV and HSA V regions. 
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pHSA (IV-V) with Apal-Sacl-EcoRI adapter [pHSA (IV-V) ASE] 

pHSA (IV-V) obtained as before contains adapter 1 downstream of the HSA coding region. Cloning of the 
HSA gene into the E. coli part (pPT2HKi) of the E.coli-yeast shuttle vector requires a downstream Sacl site and 
so this site has to be introduced somehow. It seems to be advantageous to introduce it at this stage of the 
gene assembly. The most obvious way to have a Sacl site seems to be the replacement of the Apal-EcoRI 
adapter 1 with a similar adapter having an internal Sacl site (adapter 3, see Scheme 3). 

HSA (IV-V) region was isolated from pHSA (IV-V) as a Pstl-Apal fragment and it was cloned, together with 
adapter 3 (Apal-Sacl-EcoRI adapter) into Pstl-EcoRI cleaved pUC19. 

HSA (IV-V) fragment isolation: 

2 u.g of pHSA (IV-V) was treated with 20 units of Apaf in 50 u.1 of low salt buffer at 37° C for 3 hrs. After ethanol 
precipitation, the pellet was dissolved in 50 uJ of high salt buffer and 20 units of Pstl was added. The reaction 
mixture was kept at 37° C for 4 hrs. The HSA (IV-V) fragment was purified by gel electrophoresis on a 2% 
agarose gel in TAE buffer followed by electroelution. 

Ligation: (Scheme 39) 

0.1 u.g of PStl-EcoRI cleaved pUC19 was mixed with 0.05 u,g of Apal-EcoRI HSA (IV-V) fragment and with 5-5 
pmol of 5'-phosphorylated adapter 3 oligonucleotides in 20 uJ of ligase buffer. 80 units of T4 DNA ligase was 
added and the mixture was kept at 15°C for 14 hrs. After transformation, ampicillin resistant colonies were 
screened on two different replica plates with either 5'- 32 P-HSA 16 oligonucleotide probe or 5'- 32 P-adapter 3 
lower strand oligonucleotide probe. 

Approx. 50% of the colonies showed hybridization with both probes. Positive colonies were used to prepare 
plasmid DNA and sequencing was performed by both the reverse primer and the pKO primer I. All the 10 
clones checked by sequencing proved to be correct. 

In the following, this pHSA (IV-V), which is supplied with a downstream Sacl site by introducing adapter 3, is 
used for the further steps of the HSA gene assembly. 
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pHSA (li-V) 

In this case, pHSA (ll-HI) served as a vector to clone HSA (IV-V) fragment (Scheme 40). 

5 

pHSA (ll-IH) vector cleavage : 

2 u.g of pHSA (II-III) was treated with 40 units of Apal in 50 uJ of low salt buffer at 37° C for 5 hrs. After ethanol 
precipitation, the pellet was dissolved in 50 \i\ of Klenow buffer containing 0.1 mM dNTP and 2.5 units of 
Klenow polymerase was added. The mixture was kept at room temperature for 10 min, then it was pheno! 10 
extracted and ethanol precipitated. The pellet was dissolved in 50 jil of high salt buffer and 20 units of EcoRI 
was added. The mixture was kept at 37° C for 5 hrs then the linear vector was isolated by electrophoresis on a 
0.5<Vo agarose gel in TAE buffer followed by electroelution. 

HSA (IV-V) fragment isolation: /5 

2 u.g of pHSA (IV-V) was dissolved in 50 uJ of low salt buffer and treated with 20 units of Kpnl at 37° C for 5 
hrs. After ethanol precipitation the pellet was dissolved in Klenow buffer containing 0.1 mM dNTP and 2.5 units 
of Klenow polymerase. The mixture was kept at room temperature for 10 min, then the DNA was ethanol 
precipitated and dissolved in 50 u.l of high salt buffer and 20 units of EcoRI was added. The reaction mixture 20 
was kept at 37° C for 5 hrs, then the small fragment containing HSA (iV-V) region was isolated by 
electrophoresis on a 2<Vb agarose gel in TAE buffer followed by electroelution. 

Ligation: 

25 

Approx. 0.1 jxg of linearized pHSA (II-III) vector and 0.05 u,g of HSA (IV-V) containing fragment were mixed in 
10 jl! of ligase buffer containing 80 units of T4 DNA ligase and the mixture was kept at 15°C for 7 hrs. After 
transformation into JM101 E. coli cells, ampicillin resistant colonies were tested by hybridization with 
5'- 32 P-HSA 21 oligonucleotide probe and approx. 40% of the colonies proved to be positive. 8 'colonies were 
used to prepare plasmid DNA, they were sequenced using 5'- 32 P-HSA 11 oligonucleotide as a sequencing 30 
primer. All 8 had the proper joining point between HSA III and HSA IV regions. 

The whole HSA (II-V) containing region of pHSA (ll-V) was checked by sequencing on plasmid template 
(pKO primer l and HSA 1-8 primers were used) and no mistake was found. 
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pHSA vectors (No 1, No 2) 

HSA (ll-V) fragment was supplemented with HSA I fragment by cloning these two fragments into pUC19 
vector. (Scheme 41). 5 

HSA (ll-V) could have been isolated as Sau3AI-EcoRI fragment directly as there is a unique Sau3AI site in the 
gene. The pUC1 9 vector part, however, contains many Sau3AI sites, complicating the restriction digestion and 
fragment separation. First, HSA (ll-V) was isolated in a Hindlll-EcoRI fragment, which was shortened further by 
Sau3AI treatment. 

5 u.g of HSA (ll-V) was treated in 100 uJ of high salt buffer with 40 units of Hindlll and 40 units of EcoRl at 10 
37° C for 3 hrs. The mixture was applied onto a 0.5% agarose gel (TAE buffer) and after electrophoresis, two 
fragments were obtained. The smaller fragment was electroeluted and ethanol precipitated. 

The pellet was dissolved in 50 uJ of high salt buffer and treated with 7.5 units of Sau3AI at 37° C for 14 hrs. 
The reaction mixture was phenol extracted (2x), ethanol precipitated, so the large Sau3AI-EcoRI fragment was 
not purified by gel electrophoresis in this case. is 

Two separate ligations were set up, each containing the Pstl-EcoRI cleaved pUC19 cloning vector and the 
Sau3AI-EcoRI HSA (ll-V) fragment, and one of the two HSA I fragments (as a mixture of two oligonucleotides 
forming a Pstl-Sau3AI adapter). 

0.2 p.g of Pstl-EcoRI cleaved pUC19 and 0.1 \ig of Sau3AI-EcoRI HSA (ll-V) fragment were mixed with 5-5 
pmoles of 5'-phosphorylated HSA I No 1, or HSA I No 2, (Scheme 4) in two separate reaction mixtures 20 
containing 10 uJ of ligase buffer. 80 units of T4 DNA ligase was added to both reaction mixtures, they were kept 
at 15°C for 6 hrs, then transformed into JM101 E. coli cells. The transformed mixtures were plated onto 
LB-ampicillin plates. Colonies were double-replicated onto 2 nitrocellulose filters and they were hybridized 
with 5'- 32 P-HSA 5 oligonucleotide probe (first filter) and with the corresponding 5'- 32 P-HSA I oligonucleotide 
probe (second filter). Approx. 80% of the colonies showed hybridization with both probes in both cases. 25 
Plasmid DNA was prepared from 4-4 clones of the two constructions and they were sequenced using the 
reverse primer to check the proper insertion of HSA I versions into pHSA vectors. All sequenced 
constructions were correct 

The whole HSA coding regions from pHSA No 1 and No 2 were subcloned as Pstl-EcoRI fragments into 
M13mp18 and mp19 phage vectors and the whole sequence was checked in mp19 using the 17-mer primer 30 
and HSA 1-9 primer. The mp18 constructions were checked only with the 17-mer primer. 
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The construction of the E. coli plasmid carrying the yeast promoter and terminator sequences 

1. The starting cloning vector {PGB1, Fig 1) was obtained by modification of pBR327 plasmid (Soberon, X., 
Covarrubias, L and Bolivar, F. (1980): Gene 9, 287-305.) from which the Pstl and Hindll sites from the Ap n 
region were eliminated by EMS and HA mutagenesis and repeated restriction enzyme digestion. Conditions of 
the mutagenesis were the same as described in Miller, J.H.: Experiments in Molecular Genetics. Cold Spring 
Harbor Laboratory, Cold Spring Harbor, N.Y. 

The Xhol site was introduced as a CCTCGAGG linker inserted at the unique (filled-in) Aval site. 

2. The plasmid pGB2 (HIS3): 

The 1327 bp BamHI-Xhol fragment containing the entire cloned HIS3 gene of Saccharomyces cerevisiae 
(Storms, R.K., McNeil, J.B., Khandekar, P.S., An, G., Parker, J., and Friesen, J.D. (1979) J. Bacteriol. 40, 73-82; 
and Struhl, K. (1985): Nucleic Acids Res. 13, 8587-8601) was excised from pYF 92 (Storms, et al. ibid) 
(obtained from Gyorgy B. Kiss, Institute of Genetics, Biological Research Center of the Hungarian Academy of 
Sciences, Szeged, Hungary) and inserted at the unique BamHI and Xhol sites of pGB1, resulting in pGB2 
(H1S3) (Fig 2). 

3. The plasmid pGB3-229Tcontaining the transcriptional terminator region of the yeast His3 gene: 
The EcoRI-Kpnl fragment of pGB2 (HIS3) was replaced by the 1327 bp Tc H cartridge (EcoRI-Kpnl) from the 

plasmid pJRD 158 (Davison, J., Heustersprute, M„ Merchez, M. f and Brunei, E (1984): Gene 28, 311-318) 
[obtained from John Davison (Unit of Molecular Biology, International Institute of Cellular and Molecular 
Pathology, 75, Avenue Hippocrate, B-1200, Brussels, Belgium)]. The pGB3-229T - besides the Ap R + ori 
cartridge - carries the entire Tc R gene (with an additional Sacl site at its 3'-end) and the transcriptional 
terminator region of the HIS3 gene. (Fig 3a). The pGB-229T was further modified by 1) deletion of Kpnl site in 
pGB-229T to obtain pGB3-229TK° (Fig. 3b) and 2) insertion of the Hindlll-Sall promoter fragment from 
pUC18V622PH (of Fig. 5) resulting in pPT2HKi (Fig. 3c), 

4. Cloning of the promoter region of the PH05 gene of Saccharomyces cerevisiae. 

The PH05 gene encodes a repressive acid phosphatase exoenzyme (orthophosphoric - monoester 
phosphahydrolase (acid optimum), EC 3.1.3.2.). It is a part of a 8 kb. EcoRI genomic DNA fragment (Kramer, 
R.A., Andersen, N. (1980): Proc. Natl. Acad. Sci. USA 77, 6541-6545; and Rogers, D.T., Lemire, J.M., and 
Bostian, KA (1982): Proc. Natl. Acad. Sci. USA, 79, 2157-2161). 

To obtain the plasmid carrying the PH05 gene (Davison et. al. ibid) a yeast gene bank (a cosmid library 
constructed from the genomic DNA of S. cerevisiae, obtained from Z. Feher, Debrecen Medical University, 
Debrecen. Hungary) was screened as follows: a mixture of the recombinant cosmid DNA was digested with 
EcoRI. 8 kb EcoRI fragments were isolated from agarose gels, and recloned at the EcoRI site of the plasmid 
pGB2 (HIS3). The PH05-gene containing plasmid (pGB2 (HIS3, PH05, PH03) (Fig 4) was then selected on the 
basis of the complemetation of the pho5 mutation in the yeast strains DB-4 (Rogers et. al. ibid.) and AH220 (a, 
trpl, leu2-3, 2-112, his3-11, 3-15, pho5, pho3) provided by A, Hinnen, CIBA-GEIGY, Basel; see Tait-Kamradt, 
A.G., Turner, K.J., Kramer, RA, Elliott, Q.D., Bostian, S.J., Thill, G.P., Rogers, D.T., and Bostian. K. (1986): 
Molec. and Cell. Biol. 6, 1855-1865). 

5. Subcloning of the PH05 gene promoter region. 

The promoter of the repressible acidic phosphatase gene (PH05) can be excised from the plasmid pGB2 
(HIS3, PH05, PH03) by BamHU-Sall restriction enzyme digestion as a 623 bp fragment (Meyhack, B., Bajwa, 
W., Rudolph. H., and Hinnen, A. (1982): EMBO J. 1, 675-680). The latter was recloned in pUC16 at BamHI-Sall 
sites resulting in the plasmid pUC18/623P (Fig. 5a) in which the insert's sequence was verified by sequencing 
and compared to that from published literature (Meyhack, et. al. ibid; and Arima, K., Oshima, T. ( Kubota, I., 
Nakamura, N., Mizunaga, T., and Toh-e, A (1983): Nucleic Acids Res. 11, 1657-1672). 

The BamHI-Sall (623 bp) fragment in pUC18/623P plasmid contains the PH05 upstream activating 
sequences and part of the coding sequence (encoding the N-terminal 17-amino-acid secretion signal peptide 
and 10 more amino acids from the N-end of the mature gene product). 

The primary structure of the secretiorvsignal coding region of the PH05 gene: 
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Met Phe Lys Ser Val Val Tyr Ser He Leu 
^PROMOTER - ATG TTT AAA TCT GTT GTT TAT TCA ATT TTA 

BamHI 



10 



15 



Ala Ala Ser Leu Ala Asn Ala Gly Thr 
GCC GCT TCT T TG GCC A AT GCA GGT ACC 



Ball 



Konl 



Signal end 



20 



25 



30 



35 



In this structure the Kpnl site located downstream from the "signal end" codon Ala could be used as a 
cloning site (made blunt end by Kpnl followed by trimming the 3'-protruding sequence) for the HSA coding 
gene if it were shifted by one base into the 5'-direction. 

In pUC18/623P the above mentioned Kpnl site could not be manipulated unless the upstream Kpnl site (x, 
fig 5a) had been deleted from the plasmid. The plasmid therefore was cleaved with Sac! and BamHI, followed 
by creating blunt ends by removing the protruding 3'-terminal nucleotides from the Sacl end and filling-in the 
BamHI end with DNA polymerase I Klenow fragment and nucleoside triphosphates (step 1. in Fig 5). Following 
religation and transformation, resulting in the plasmid pUC18 7623P (Fig 5b), the BamHI site was restored and 
the Kpnl site downstream from the "signal end" codon became unique, thus suitable for further manipulations 
and in vitro mutagenesis (see below). 

6. In vitro mutagenesis of the "signal end" site: a one-base shift of the Kpnl site in order to create a splice 
site being "in-phase" with the "signal end" codon (Ala). 

To be able to ligate the 5'-blunt end of the HSA gene with the PH05 signal coding sequence in the correct 
phase, the Kpnl site has to be shifted by one base into the S'-direction. It was noticed that the deletion of the 
adenosine residue (A) upstream from the Kpnl site did not result in any changes in the nature of the encoded 
amino acids within the signal sequence 
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45 



50 



Ser Leu Ala Asn Ala Gly Thr 
►TCT T TG GCC AA T GCA GGT ACC A. 
Ball >KKpnI 



Signal end 



Ser Leu Ala Asn Ala Val 
.TCT TTG GCC AAT GCG GTA CCA . 



Ball 



Kpnl 



55 



60 



Cleaving the modified sequence with Kpnl and then removing the protruding GTAC-3' nucleotides with DNA 
polymerase I Klenow fragment + dNTP generates a blunt end at which the Kpnl site becomes in an exact 
coincidence with the position of the "signal end" codon (GCG). 

To achieve the above mentioned structural change, 



the CCAATGCAGGTAC 
GGTTACGTC 



fragment of the pUCl8 /623P 
located between 
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Ball and Kpni sites was replaced by a synthetic linker 

CCAATGCGGTAC resulting in plasmid 

GGTTACGC pUC18 /622P. The replacement 

was verified by sequencing (step 2. in Fig. 5). 

For further cloning purposes the EcoRI site (upstream from the PH05 promoter) was also replaced by a new 
Hindlll site by Inserting a Hindlll linker (CAAGCTTG) at the filled-in EcoRI site (step 3. in Fig. 5). This new 
construction was called pUC18V622PH (Fig 5c). 

7. Construction of the plasmid pPT2HK1 containing the yeast expression cassette: 

The plasmid pGB3-229T (Fig 3) contains a Sacl (Sstl) and a Kpnl site downstream from the Tc n -gene. By 
inserting the PH05 promoter region (at the unique Hindlll and Sail sites from pUC18V622PH) the Kpnl site 15 
from pGB3-229T would become superfluous, thus the Kpnl site was deleted from pGB3-229T by Kpnl 
digestion and removal of the protruding 3y end by the Klenow polymerase + dNTP, religation of the blunt ends 
and transformation. The new plasmid (pGB3-229TK°) (Fig 3), lacking the Kpnl site) was cleaved with Hindlll 
and Sail and the Hindlll-Sall fragment of pUC18 7622PH (containing the modified PH05 promoter and signal 
sequence) was cloned in, thus creating a tetracycline sensitive plasmid pPT2HK1 (Fig 3 and 6) which carries a 20 
functional yeast expression casette consisting of the in vitro mutagenized PH05 promoter and signal-coding 
region and the transcriptional terminator of the HIS3 gene. 

8. The construction of the E. coli-yeast shuttle vector plasmid pBY200. 
The major points of consideration are: 

25 

- to utilize the useful properties of the "classical " E. coli-S. cerevisiae shuttle vector plasmid pJDB207 (Beggs, 
J.D. (1981 ) : Multiple-copy yeast plasmid vectors. Von Wettstein, D., Friis, J M Kielland-Bradt, M., and Stenderup, 
A. (eds) Molecular genetics in Yeast. Alfred Benzon Symposium Vol. 16, 83-390), i.e. 1) relatively small size in 
comparison with many other yeast cloning vectors; 2) high-copy-number replication of the plasmid in yeast 

host cells; 3) stable selection of the plasmid-containing yeast cells (of leu 2 phenotype) due to the presence of $q 
the LEU2 selective marker gene, giving 4) also possibility of direct selection in leuB E. coli hosts; 

- to contain suitable restriction enzyme recognition sites that make it compatible with the E. coli plasmid 
pPT2HK1 carrying the yeast expression cassette (see above) and its recombinant derivatives. 

The plasmid pBY 200 was constructed by two steps of cloning (Fig 7) : 

1. Insertion of the "LEU2 + 2 u, on" cartridge (a 3.4 kb EcoRI fragment obtained by partial EcoRI 35 
digestion of pJDB 207 (Beggs et. al., ibid), into the EcoRI site of pGB1 ; 

2. Filling-in with DNA polymerase Klenow fragment (followed by religation of the blunt ends) of the Xba I 
site in the "2 u, ori° region. This modification had no effect on the ability of the plasmid to replicate in S. 
cerevisiae. 



Cloning the HSA genes (No 1, No 2) into pPT2HKi E . coli vector 



40 



pPT2HKi E. coli vector is shown in Fig. 3 and 6 and its modified signal sequence region is described above. 

Its main features from the point of view of HSA gene cloning are that it contains the yeast PHOS promoter 
and the PHOS signal sequence as well as a yeast transcription terminator (HIS3). The promoter-signal 45 
sequence and the terminator regions are separated by unique restriction sites so that the HSA coding gene 
segment (structural HSA gene) can be inserted between these two regions. 

In the pPT2HKi vector the restriction sites used to insert the HSA gene are Kpnl and Sacl sites. The Kpnl 
site at the end of the signal sequence (leader peptide coding region) was previously shifted by us so that after 
Kpnl cleavage followed by trimming of the resulting 3'-protruding region a blunt end was formed and this blunt 
end coincides exactly with the end of the leader peptide coding region (Scheme 42). The Sac I site is located 
upstream of the HIS3 termination region and the Sac I cleavage is performed after the Kpn I cleavage and blunt 
end formation. 

pPT2HKi cleavage: 55 

2 u,g of pPT2HKi was treated with 20 units of Kpn I In 50 uJ of low salt buffer at 37° C for 2 hrs. After ethanol 
precipitation, the pellet was dissolved in 50 uJ of Klenow buffer containing 0.1 mM dNTP and 2.5 units of 
Klenow polymerase and the reaction mixture was kept at room temperature for 10 min. The reaction mixture 
was phenol extracted and ethanol precipitated. The pellet was dissolved in 50 u.1 of low salt buffer and 20 units so 
of Saci was added followed by incubation at 37° C for 5 hrs. After ethanol precipitation, the large vector 
fragment was isolated by electrophoresis on a 0.5% agarose gel in TAE buffer followed by electroelutlon. 
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pHSA No 1 cleavage to obtain HSA No 1 fragment: 

2 ug of pHSA No 1 dissolved in 50 ul of high salt buffer was treated with 20 units of Pstl at 37° C for 2 hrs. 
After ethanol precipitation, the pellet was dissolved in 50 ul of Klenow buffer containing 0.1 mM dNTP and 2.5 
5 units of Klenow polymerase and the reaction mixture was kept at room temperature for 10 min. The reaction 
mixture was phenol extracted and ethanol precipitated. The pellet was dissolved in 50 ul of low salt buffer and 
20 units of Sad was added. The mixture was kept at 37° C for 5 hrs, then applied onto a 0.5% agarose gel (TAE 
buffer). The smaller fragment was isolated after electrophoresis followed by electroelution, 

10 pHSA No 2 cleavage to obtain HSA No 2 fragment: 

pHSA No 2 was reisolated from a dam ( ' J E. coli strain in order to be able to work with Bell enzyme, which is 
sensitive to adenine methylation. 

2 jxg of pHSA No 2 dissolved in 50 ul of buffer containing 75 mM KCI, 6 mM Tris-HCI, pH 7.4, 10 mM MgCk 
15 and 1 mM DTT was treated with 20 units of Bell at 50° C for 5 hrs. After ethanol precipitation, the pellet was 
dissolved in 50 ul of mung bean nuclease buffer and 10 units of mung bean nuclease was added at 37° C for 30 
min. After phenol extraction and ethanol precipitation, the pellet was dissolved in 50 ul of low salt buffer and 20 
units of Sac I was added. 

The mixture was kept at 37° C for 5 hrs. The smaller fragment containing the HSA No 2 was isolated as it was 
20 described for HSA No 1. 

Ligations: 

0.1 ug of cleaved pPT2HKi and 0.2 ug of HSA No 1 or HSA No 2 fragment was mixed in 10 ul of ligase buffer 
25 and 80 units of T4 DNA ligase was added. The reaction mixtures were kept at 15°C for 15 hrs, then they were 
transformed into JM 101 E. coli cells followed by plating onto LB-ampicillin plates. Colonies were tested by 
hybridization with 5'- 32 P-HSA 5 oligonucleotide probe and approx. 10% of them were found to be positive. 
Plasmid DNA was prepared from 5-5 recombinants and they were sequenced by using HSA primer 9. Proper 
junction of PH05 leader sequence and HSA coding sequence was obtained in 2 cases for HSA No 1 and in 3 
30 cases for HSA No 2. These plasmids are called pPT2/HSA No 1 and pPT2/HSA No 2, respectively. 

In these constructions, the HSA gene is cloned in an E. coli plasmid between a yeast promoter + signal 
sequence and a yeast transscriptional terminator. In the next step, this "HSA expression cartridge" should be 
transfered into an E. coii-yeast shuttle vector. 

35 
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Cloning the HSA expression cartridge into pBY200 and pJDB 207 

The yeast-E. coli shuttle vector pBY200 contains both the yeast and E. coli replication origin, an Ap n region 
5 and a Leu2 marker. This plasmid can be cleaved with Hindlll and Xhol enzymes so that the resulting large 
fragment keeps all the above mentioned region, and can serve as a vector to clone the HSA expression 
cartridge obtained from pPT2/HSA by Hindlll and Xhol cleavages (Fig. 8). 

pBY 200 cleavage : 

10 

5 ng of pBY 200 was dissolved in 100 u.l of buffer containing 50 mM NaCI, 10 mM Tris-HCI pH 7.5, 10 mM 
MgCl2 and 2 mM DTT (medium salt buffer) and was treated with 40 units of Hindlll and 60 units of Xhol at 37° C 
for 6 hrs. The large fragment was isolated by gei electrophoresis on a 0.5% agarose gel in TAE buffer followed 
by electroeiution. 

15 

pJDB 207 cleavage : 

Similarly, the yeast-E. coli shuttle vector plasmid was cleaved with Hindlll and Sail restriction enzymes under 
conditions described above for the pBY200 except that 45 units of Sail was used instead of Xhol. The large 
20 vector fragment (Fig. 8) was isolated by electrophoresis and purified from agarose gel by electroeiution. 

pPT2/HSA cleavage : 

5 u,g of pPT2/HSA No 1 or pPT2/HSA No 2 was treated as above in 100 u.l of medium buffer with Hindlll and 
25 Xhol enzymes. The larger fragment was isolated in both cases by electrophoresis on a 0.5% agarose gel 
followed by electroeiution. 

Ligation : 

30 0.2 \ig of Xhol-Hind!ll cleaved pBY 200 was mixed separately with either 0,2 fig of Xhol-Hindlll fragment of 
pPT2/HSA No 1 or 0.2 u.g of Xhol-Hind III fragment of pPT2/HSA No 2 in 10 ul of ligase buffer and 80 units of T4 
DNA ligase was added to both mixtures which were kept at 15'C for 15 hrs. The reaction mixtures were 
transformed into frozen competent E. coli cells (JF 1754) followed by plating on LB-ampicillin plates. Similar 
condition were used for ligation of the Xhol-Hindlll fragment of pPT2/HSA No 1 into pJDB 207 cleaved with 

35 Hindlll and Sail. 

Selection and analysis of pBY2/HSA No 1 and pBY 2/HSA No 2 recombinants 

Colonies grown upon LB-ampicillin plates were picked onto 1.) M9 minimal plate containing 20 u.g/ml 
40 methionine and 20 u,g/ml histidine (but lacking leucine), 2.) LB-tetracycline plate and 3.) a nitrocellulose filter 
placed onto an ampicillin-LB plate. Colonies grown up on the nitrocellulose filter were lysed and hybridized 
with 32 P-labeled HSA 6 oligonucleotide probe. Positive colonies which were tetracycline sensitive on plate 2 
and showed leu complementation on plate 1 (i.e. did not grow on plate 2 but grew up on plate 1) were selected 
and plasmid DNA was prepared from them (approx. 20% of the total colonies obtained on LB-ampicillin plate 
45 showed the expected phenotype on plates 1 -3). Recombinant plasmid DNAs were cleaved with the mixture of 
Xhol and Hind III and the cleavage was checked by electrophoresis on a 0.5% of agarose gel in TBE buffer. 
Upon this double-cieavage, both pBY2/HSA No 1 and pBY2/HSA No 2 gave two fragments with sizes 
corresponding to the size of the starting pBY 200, pPT2/HSA No 1 and pPT2/HSA No 2 fragments, 
respectively. At the same time, Kpnl cleavage resulted in a linearized vector in both cases. In order to control 
50 the structure of pYHSA 221 the recombinant plasmid was cleaved with Xbal resulting in two fragments with 
sizes expected from the physical map (Fig. 8). 

All the plasmid constructions and cloning steps leading to the yeast expression vectors containing the HSA 
gene are summarized in Fig. 9. 

55 Expression of the synthetic HSA gene in recombinant yeast cells 



Transformation of yeast cells and culture conditions for the induction of the PHO 5 promoter : 

The synthetic HSA gene was placed under control of the yeast PHO 5 promoter in a series of manipulations 
described in details above leading to the construction of the yeast-E. coli shuttle plasmid pBY2/HSA No 1 and 
PBY2/HSA No 2, and pYHSA 221 (Fig 8). Yeast cells (LL 20; Leu 2-3, 112, His 3-11, 15: Storm, et. al., ibid) were 
transformed either by the spheropIast-PEG method of Beggs, J.D. (Nature 275, 104, (1978)) or Ito, H. et. al. (J. 
Bacterid. 153, 163 (1983)). 

The recombinant yeast cells were selected on the basis of their His - , Leu* phenotype and tested for the 
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presence of the transforming plasmids by reisolating said plasmids from 10-ml cultures by the method of Holm 
et ai. (Gene 12, 169 (1986)) and analysing their structure by restriction enzyme cleavage and electrophoresis 
on 1o/o agarose gel. The recombinant yeast cells in each case contained transforming expression vector 
plasmids of the proper size and structure. These cells were grown in YNB medium (Difco) containing 2% 
glucose and 0.15 <Vb KH2PO4 to ODeoo of 2.0, harvested and diluted into a low-phosphate YNB medium 5 
(containing 30 mg KH2PO4 per liter, to activate the PHO 5 promoter), and were regrown for 60 hrs before 
harvesting (ODsoo-2.0). The cells were then washed with 0.1 M Na-phosphate buffer, resuspended in one 
hundredth volume of the same buffer containing 1 o/ 0 Triton X-100, 0.1 mM phenylmethyl-sulphonyl fluoride 
(PMSF) and broken by vortexing with glass beads (Sigma, type IV, 250-300 microns). Alternatively, the cells 
were washed and resuspended in 1M sorbitol and incubated with ^-glucuronidase (Boehringer; 1 0/0 solution) 10 
in 100 mM (i-mercaptoethanol at 30° C to produce protoplasts which were then iysed with 1<Yo Triton X-100. 
Cell ex tracts were clarified by centrifugation at 10 000 rpm for 15 min resulting in the so called "periplasmic 
fraction" in which HSA was assayed by the following immunological and electrophoretic methods. 

Micro-ELISA test : 15 

The EUSA plates were coated with antl-HSA-Ab (purified from horse serum - a product of HUMAN, 
Hungary - on Protein A-Sepharose 4B columns) and saturated with 0.5 0/0 Gelatine (Sigma). 100 jil of clarified 
yeast ceil extract was layered in appropriate dilutions onto the coated wells and incubated for 1 hour at 37° C. 
The HSA-anti-HSA-Ab binding was monitored by a conventional color reaction by using biotinylated 20 
horseradish peroxidase-streptavidine complex, H2O2 as substrate and ortho-phenylene diamine as developer. 

Serial dilutions ranging from 2 u,g to 1 5 jig of purified HSA (Reanal, Hungary) per well were used as reference 
for calibration. A thousand-fold dilution of human serum was used as a positive control, gelatine-coated wells 
as well as extracts from non-recombinant yeast cells (LL 20, processed as for the HSA assay) served as 
negative controls in micro-ELISA tests. The color reactions were evaluated in a microplate reader of 25 
Cambridge Life Sciences Ltd., UK. 

Immunoprecipitation of 35 S-Methionine-labeled proteins: 

The whole-cell proteins of the recombinant yeast cells were labeled for 16 hours at 30°C with 30 
35 S-methionine by culturing the yeast cells in "low-methionine, low-phosphate" YNB medium containing 40 
\iC\ of 35 S-methionine per milliliter. 

20 uJ of horse anti-HSA serum was added to 10 8 cell equivalents of clarified cell lysates (0.5 ml) for 90 
minutes at 4°C in 0.1 M phosphate buffer, pH 8.0. The immunoprecipitates were adsorbed onto 1-ml protein 
A-Sepharose (Pharmacia) for 90 minutes at 4°C and washed. Immunoprecipitated proteins were eluted from 35 
the protein A-Sepharose beads (Conner, G.E. er. al. t J. Exp. Med. 156, 1475, 1982) and resolved on a 15 
percent SDS-polyacrylamide gel and fluorographed. Clarified extracts obtained from 35 S-methionine-labeled 
non-recombinant LL 20 cells and that of a recombinant yeast strain expressing the hepatitis B surface antigen 
(HBsAg) were used as controls. 

40 

Results : 

The yeast cells transformed with the plasmids pBY2/HSA No 1 and pYHSA221 exhibited active production 
of the HSA protein which could be readily detected by EUSA as well as precipitated with specific antiserum 
directed against HSA. 45 

According to the micro-ELISA test the proportion of the of HSA ranged between 3-8% of total cell protein. 

Fig. 10 shows the fluorograph of the 35 S-methionine-labeled proteins obtained from the periplasmic fraction 
and immunoprecipitated with goat anti-HSA serum and resolved in SDS-polyacrylamide gel. 

Track M - 14 C-protein molecular weight marker mix (BRL); track A - recombinant 35 S-HBsAg precipitated 
with anti HBsAg antibodies; track B - labeled HSA produced in the recombinant yeast precipitated with 50 
anti-HSA serum; tracks C and D demonstrate the lack of cross-immunreactions of anti-HSA serum with 
HBsAg-containing yeast lysate and that of anti-HBsAg serum with the HSA lysate, respectively. 

The electrophoretic mobility of the immunoprecipitated HSA was approximately the same as that of the 67 
kd labeled protein marker. This result indicates that the majority of the HSA protein secreted into the 
periplasmic space by the expression vector construction involving the entire signal peptide of the PHO 5 gene 55 
is correctly processed yielding a protein product with the size of the mature (natural) HSA. 

Two independent immunoblotting experiments (western blots) revealed a protein of the same molecular 
mass. 

Laboratory-scale purification of expressed HSA from yeast cultures 60 

500-ml culture of yeast cells transformed with either pBY2/HSA or pYHSA 221 was grown at 30° C to 
ODeoo = 2.0 (usually 24-28 hours) in 0.67% YNB medium (DIFCO) containing 0.1 50/0 (w/v) KH2PO2 20 
mg/liter L-histidine and 2<Vb (w/v) glucose. The cells were collected by centrifugation at 2000 x g for 5 min and 
resuspended in 10 liters of 0.67% (w/v) YNB medium containing 30 mg/liter KH2PO4, 0.1% (w/v) KCI, 20 65 
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mg/liter L-histidine and 2o/o (w/v) glucose. Following 60 hours of culture growth (to ODsoo 1.8-2.0) the cells 
were harvested by centrifugation (4000 x g, 5 min) washed twice with ice-cold distilled water and resuspended 
in 200 ml O.Wa Triton X-100, 0.5 M NaCl, 20 mM Tris/HCI. pH 7.5, 100 mM P-mercaptoethanol and 1 mM PMSF. 
The cells were homogenized for 60 sec. in pre-cooled glass-bead cell homogenizer (Model: Braun MSK). The 

5 cell extract was clarified by high-speed centrifugation (at 20 000 x g, 4°C, for 30 min). The pH of the clarified 
lysate was adjusted (by dropwise addition of 1 M HCI) to 4.8-5.0, then saturated solution of (NH 4 >2S04 was 
added to a final concentration of 60<Vb of saturation. The mixture was stirred for 2 hrs in ice-water bath, then 
centrifuged in Sorvall RC-5C centrifuge for 30 min at 18 000 rpm (2°C). The pellet was dissolved in 100 ml of 50 
mM Bis-Tris buffer, pH 6.5, followed by dialysis overnight against 20 volumes of the same buffer. The dialysed 

10 lysate was centrifuged for 30 min, (18 000 rpm, 2°C) and the clear supernatant was applied onto a Superose 
MONO Q HR 5/5 FPLC column (Pharmacia) equilibrated with the same buffer. 

The anion exchange chromatography as well as all successive chromatographic purification steps were 
performed on a Pharmacia FPLC system. 
After a short linear gradient of NaCl (0.0-0.1 M) proteins eluting with 0.1 M NaCl (isocratic elution) were 

15 collected and dialysed against 0.05 M Na-phosphate buffer, pH 7.5. 7 his fraction was subjected to hydrophobic 
interaction chromatography on Alkyl-Superose HR 5/5 column. 

Solid ammonium sulfate was added to the above dialysed fraction, the final concentration to be adjusted to 
2.0 M t and the sample was loaded onto Alkyl-Superose HR 5/5 column equilibrated with 2 M (NH4teS04 in 50 
mM Na-phosphate buffer. Bound proteins were eluted with linear descending-concentration gradient of 

20 (NH4)2S04 The HSA-containing fraction was eluted at about 1.2 M {NH 4 )2S04, which was monitored by 
SDS-PAGE of the eluted fractions. 

Gel-filtration . 

25 The "HSA" fraction from the previous step was concentrated by ultrafiltration in an Amicon stirred cell (filter: 
PM-30), then loaded onto Superose 12 HR 10/30 column equilibrated with 50 mM Na-phosphate buffer, pH 7.5 
containing 0.15 M NaCl. The first large peak upon gel-filtration contained highly purified monomeric HSA as 
was tested by SDS-PAGE, according to Laemmli U.K., Nature 227. 680 (1970). Additional molecular analyses 
included PAGE at "native" conditions, IEF and limited CNBr-cleavage (Barsh, G.S., and Byers. P.H., 1981. 

30 Proc. Natl. Acad. Sci. USA 78:5142-5146). 

Molecular properties of HSA purified from recombinant yeast 

The HSA purified from yeast cells was shown to run as a single 68-kilodalton protein band in 
35 SDS-polyacrylamide gels indicating that it has the same molecular mass as the mature natural HSA. 

Electrophoresis in native conditions (carried out on PHARMACIA'S 10-15<Vb PHAST GELS according to the 
manufacturer's instructions) indicated that the behaviour of HSA produced by yeast was similar to that of 
natural mature HSA and had also similar tendency to form double, triple and multimer complexes probably by 
random formation of intermolecular -S-S- bridges. 
40 The absence of glycosylation in HSA produced in yeast was proven by Con A-Sepharose chromatography: 
500 u.g of a partially purified protein extract obtained from HSA-producing yeast cells was allowed to bind to 
750 uJ of swollen Con-A-Sepharose (Pharmacia) in 1 .5 ml of buffer containing 20 mM Tris-HCI. pH 7.4, and 0.5 
M NaCL The suspension was slowly shaken overnight at 4°C and the Con A-Sepharose was separated from 
the buffer (containing the unbound proteins) by centrifugation at 12 000 x g for 10 min. The Con A-Sepharose 
45 gel was then washed with 100 ml of same buffer by filtration through a 25-mm circle of Whatman GF/A filter. 
The bound proteins were eluted by a buffer containing 20 mM Tris-HCI, pH 6.8, 0.25 M a-D-methytmannoside 
(Serva) and 0.25 M NaCl. 

Both the unbound and bound (to Con A-Sepharose) protein fractions were dialysed against 10 mM Tris-HCI 
(pH 6.8) and subjected to 1) SDS-PAGE according to Laemmli, U.K. (Nature 227, 680 (1970)), and 2) 
50 ELISA-test in order to control the presence of HSA. 

A similar approach was applied for a fraction of purified HSA. 

In each case, the SDS-PAGE and ELISA tests revealed the absence in the fraction of Con A-binding proteins 
of any 68 kd protein as well as any proteins showing immunological reactions with anti-HSA-Ab. The HSA was 
quantitatively recovered from the protein fraction which did not bind to Con A-Sepharose upon application of 
55 the sample. 

The results strongly indicate the absence of glycosylation in the molecules of HSA produced in yeast. 

Prior to peptide mapping by limited proteolysis the samples were heat-denatured in the presence of 0.5°/o 
(w/v) SDS without addition of a reducing agent, and subjected to enzymatic digestions for 10 to 20 minutes. 
Subtilisin, thermolysin, trypsin and papain were used. The cleavage by CNBr was carried out as described by 
60 Barsh et al. (ibid). 

Fig 12 shows the CNBr cleavage pattern of natural HSA (A and C) (purified as described above from 
commercial sources) and yeast-produced HSA (B and D), as demonstrated by SDS-PAGE separation of the 
cleaved polypeptides. After digestion, SDS and p-mercaptoethanol were added to concentrations (w/v) 2.5<>/o 
and 100/0, respectively. The samples were loaded onto 8-250/o gradient PHAST GEL (PHARMACIA) and the 
65 electrophoresis was carried out in a PHARMACIA PHAST SYSTEM according to the manufacturer's 
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instructions. 

The results obtained indicate that the HSA purified from recombinant yeast showed cleavage patterns by 
proteolytic enzymes and cyanogen bromide similar to those of natural HSA (with a note that the 
yeast-produced HSA was less accessible to papain digestion under the conditions used than the natural HSA). 

A sample of the HSA purified from yeast was subjected to N-terminal sequencing on an Applied Biosystems 
Model 470 A gas phase sequencer. The result of the sequencing did not reveal any other amino acid residues 
than those expected. 

Construction of plasmid vectors promoting the secretion of the HSA into the culture medium 

The experirnentai strategy for the construction of new expression-secretion vectors was based on the 
finding that prepro HSA was correctly processed in vitro by the yeast KEX2 endopeptidase yielding mature 
Asp-Ala-HSA (Bathurst, i.C. et al., 1987, Science 235, 348-350). The natural N-terminal HSA prepro-leader 
peptide was evaluated as a sequence capable of promoting the secretion of HSA from the recombinant yeast. 

The sequence of a 103-mer synthetic DNA fragment coding for the HSA preprox* -leader peptide was 
designed as follows: 



M K W V T F 
Met Lys Trp Val Thr Phe 
GATCAAAAACACTAAAATATAATCAAA ATG AAG TGG GTT ACT TTC 



ISLLFLF.SSAYSR 
He Ser Leu Leu Phe Leu Phe Ser Ser Ala Tyr Ser Arg 
ATC TCT TTG TTG TTC TTG TTC TCT TCT GCT TAC TCT AGA 



G V F K X R 
Gly Val Phe Lys Arg 
GGT GTT TTC AAG AGG CCT G 

Stu 1 



The sequence (27 nucleotides) upstream of the ATG codon was designed to be closely homologous to the 
downstream end of a strong constitutive yeast promoter, i.e. that of the gene coding for glyceraldehyde- 
3-phosphate dehydrogenase (GAPDH; Holland, J.P. and Holland, M.J., 1979, J. Biol. Chem. 254, 9839-9845). 
Other characteristic features of the above DNA sequence include the usage of the most frequent yeast 
codons, as well as an "ideal" KEX2 cleavage site {K-R A ; Kurjan, J., Hershkovitz I., 1982. Cell 30, 933-943) coded 
by the AAG AGG codons which - according to this design -coincides with a Stu I restriction endonuclease 
digestion site. 

1. Construction of pHSA-T plasmid containing the gene for HSA No 1 and the yeast His3 transcriptional 
terminator 

The 1.8 Kb Hind Ill-Sacl fragment from pHSA No. 1 (i.e. the gene coding for HSA) was cloned into 
pGB3-229TK° (Fig. 3b) at the Hind III and Sad sites. Prior to this cloning step the Pstl site (located in the His3 
terminator region) was deleted, since it would become double after the insertion of the HSA gene. 

0.5 \lq of pGB3-229TK° was treated with 10 units of Pstl in 20 ui of medium salt buffer at 37°C for 2 hrs. After 
phenol extraction and ethanol precipitation the pellet was dissolved in 50 uJ of Klenow buffer containing 0.1 
mM dNTP and 2.5 units of Klenow polymerase, and the reaction mixture was kept at room temperature for 40 
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min.The reaction mixture was then phenol extracted and ethanol precipitated. The DNA pellet was dissolved in 
100 uJ of ligase buffer and 50 units of T4 DNA ligase was added. The ligase reaction was carried out at 15°C for 
15 hours, followed by transformation of E. coli JM 109. Plasmids (from about 30% of ail transformants) 
containing no Pst! site (designated as pGB3T) were selected and used for the insertion of the HSA gene as 
5 follows: 

a) 2 u.g of pGB3T was digested with 10 units of Sad in low salt buffer, at 37°C for 4 hrs, in a final volume 
of 20 u.l. The buffer was then adjusted to be optimal for Hind III digestion, 10 units of Hind III was added 
(final volume 40 uJ) and the treatment was carried out for additionally 4 hrs at 37° C. The 2.06 Kb vector 
was separated in 0.8<Yo agarose gel. 
10 b) 5 jig of pHSA No. 1 was digested by Sad and Hindlll as described above. The 1 .8 Kb HSA fragment 

was isolated from 0.8<Vo agarose gel. 

c) The ligation of the 2.06 Kb pGB3T vector and the 1 .8 Kb HSA insert was carried out in 20 \i\ ligation 
mix (containing 80 units of T 4 DNA ligase) at 15°C for 16 hrs. E. coli JM109 cells (Yanisch-Perron, C. et. 
al.. ibid) were transformed. Plasmids isolated from tetracycline sensitive transformants were tested for 
15 restriction enzyme digestion pattern, viz. by double-digestion with Sail and Xhol. 

The resulting plasmid was designated as pHSA-T (Fig. 11). 

2. Insertion of a strong constitutive promoter and the artificial prepro-leader sequence into pHSA-T; 
construction of the plasmid pGprepro*HSA-T (Fig. 11). — 

20 

a) Cloning of the artificial prepro '-leader-coding sequence downstream of the GAPDH promoter. 

0.5 u,g of M13/GPD-3 (RF) DNA (Bitter, G.A. and Egan, K.M. (1984): Gene 32. 263-274) was treated with 5 U 
25 of EcoRI in medium salt buffer for 2 hrs at 37° C followed by digestion by 5 U of BamHI for additional 2 hrs at 
37°C in high sait buffer. The digestions were terminated by phenol extraction and ethanol precipitation. The 
DNA pellet was washed with 700/o ethanol, dried under vacuum and dissolved in 5 u.1 H2O for ligase reaction 
with the artificial HSA prepro '-leader. 
The ligation mixture contained the EcoRI-BamHI-treated M13/GPD-3 DNA, 1 pmole of the synthetic double 
30 -stranded 103-mer DNA fragment (coding for prepro* -leader) and 80 U of T 4 DNA ligase, in 15 uJ ligase buffer. 
The ligase reaction was carried out at 15° C for 16 hrs, after which E. coli JM 109 was transformed. 

The phage transformants were screened for the insertion of the 103-mer BamHI* -EcoRI prepro '-leader- 
coding fragment by the dideoxynucleotide sequencing method. 
The transformants containing the HSA prepro '-leader-coding sequence placed behind the GAPDH 
35 promoter were named MI3/Gprepro*KR (Fig. 11.) ("KR" indicates Lys-Arg and G denotes the GAPDH 
promoter). 

Cloning the HSA gene behind the GAPDH promoter-prepro* sequence fusion: construction of 
pGprepro*HSA-T 

40 

a) pHSA-T was digested with Pstl (0.5 u.g DNA, 5 U Pstl, 4 hrs at 37° C) and the cleaved 3'-protruding 
end was made blunt by treatment with the Klenow polymerase. The linearized plasmid was then further 
cleaved with 5 U of Hind III (4 hrs at 37° C), followed by phenol extraction and ethanol precipitation. 

b) The GAPDH promoter + artificial prepro* -leader-coding sequence was isolated from M13/G 
45 prepro'KR by simultaneous digestion of 5 u.g plasmid DNA with 20 U of Hind III and 20 U of Stu 1 (5 hrs at 

37°C in medium-saJt buffer). The 0.75 Kb pro moter-f- prepro* fragment was isolated by electrophoresis in 
1% agarose gel. electroeluted, phenol extracted and ethanol precipitated. 

c) The purified promoter + prepro* fragment was ligated into the Pstl (blunt)-Hind Ill-treated vector 
pHSA-T (in 20 uJ mix, at IS^C for 16 hrs) followed by transformation of E. coli JM 101. The resulting 

50 plasmid, pGprepro * HSA-T (Rg. 1 1 ) was tested by mapping restriciton endonuclease cleavage sites. 

Construction of the yeast - E. coli shuttle vector containing the prepro'-HSA-expression - secretion cassette 

The prepro*-HSA expression cassette was isolated from pGprepro 'HSA-T (by Hindlll + Xhol digestion; 2 
55 u,g DNA f in 20 u.l high-salt buffer, 10 U of Hindlll and Xhol each at 37°C for 10 hrs followed by electrophoretic 
separation on a 0.8<Vo agarose gel, electrocution, phenol extraction and ethanol precipitation). The 
Hindlll -Xhol fragment was then ligated into pJDB207 (between Hindlll and Sail sites) resulting in 
YEp/Gprepro* HSA (Fig 11) which was used to transform yeast LL20. Yeast transformants were selected on 
YNB-agar plates (lacking leucine). The expression and secretion of HSA was tested in shake-flask cultures as 
60 described below. 

Expression and secretion of HSA by the recombinant y east transformed with pYEprepro*HSA 
(YEprepro*-HSA)T 

65 A single colony of the yeast YEprepro'HSA was inoculated into 10 ml of YNB medium containing 2°/o 
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glucose and 200 jig/ml histidine and the cells were grown overnight at 30° C with continuous shaking. 1 ml of 
the overnight culture was diluted into 200 ml of the above medium and further grown to OD 60 o = 2.0. The cells 
were precipitated by centrifugation (6000 r.p.m.; 4°C; 15 min), the supernatant was saved and concentrated 10 
times by using an Amicon stirred ultrafiltration cell with PM-30 filter. The concentrated cell medium was then 
dialysed overnight against 20 mM Tris/glycine, pH 8,3, 1 mM EDTA, 5 mM p-mercaptoethanol and 0.01o/o SDS. 
Secreted HSA was assessed by quantitative micro-ELISA as described above. 

It was found that at least 3000 ag HSA per 100 ml culture medium was produced by the yeast cells 
YEprepro*HSA. 

The secreted HSA was subjected to SDS-polyacrylamide gel-electrophoresis followed by immunoblotting 
and staining by conventional methods. 

It was shown (Fig 13) that, although mature HSA (68 Kd) was the major product observed in the culture 
medium, a fragment of HSA with a molecular mass of 46-48 Kd was also detected representing approximately 
1/3 of the total HSA produced. The 68 Kd mature HSA could be readily purified by a series of chromatographic 
steps and gelfiitration (on Superose 12 HR 1 °/30) as previously described in connection with Laboratory-scale 
purification of expressed HSA from yeast cultures. 



Claims 

20 

1. A structural gene coding for authentic human serum albumin, characterized by a nucleotide 
sequence wherein the codons have been selected with regard to a non-human host chosen for 
expression of authentic human serum albumin, whereby the selection of the codons has been effected so 

that, 2S 
in the first instance, the codons most frequently used by the chosen non-human host were selected, and 
in the second instance, the codons used by the chosen non-human host in the second or third place were 
selected, 

to avoid the appearance of such restriction sites which are to be used during the assembly of the gene, 
to create one unique cleavage site for a specific enzyme, and ' 30 

to eliminate 8-base-pairs long or longer palindromes within such parts of the gene which are to be 
chemically synthesized and cloned. 

2. A structural gene according to claim 1, characterized by a nucleotide sequence wherein the codons 
have been selected with regard to a yeast host. 

3. A structural gene according to claim 2, characterized by the nucleotide sequence 35 
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4. A structural gene according to any one of claims 1-3, supplemented by an upstream nucleotide 
sequence coding for methionine. 

5. A structural gene according to any one of claims 1-4, extended by an upstream nucleotide sequence 
in which the codons have been selected with regard to a non-human host and which codes for the amino 
acid sequence 

Met-Lys-Trp-Val-Thr-Phe lle-Ser-Leu-Phe-Leu-Phe-Ser-Ser-Ala-Tyr-Ser-Arg-Gyl-Val-Phe-Lys-Arg 

6. A structural gene according to claim 5, characterized in that the codons have been selected with 
regard to a yeast host. 

7. A structural gene according to claim 6, characterized in that the nucleotide sequence which codes 
for the amino acid sequence is 



Met-Lys-Trp-Val-Thr-Phe-Ile-Ser-Leu-Leu-Phe-Leu-Phe- 
-Ser-Ser-Ala-Tyr-Ser-Arg-Gly-Val-Phe-Lys-Arg 



8. A recombinant DNA molecule comprising a gene according to any one of claims 1-7, inserted into a 
vector. 

9. A host transformed with a recombinant DNA molecule according to claim 8. 

10. A method of producing a structural gene coding for authentic human serum albumin, characterized 
by the following steps, 

a) designing the nucleotide sequence coding for authentic human serum albumin by selecting 
codons with regard to a non-human host chosen for expression of authentic human serum albumin, 
whereby the selection of the codons is effected so that 

in the first instance, codons most frequently used by the chosen non-human host are selected, and 
in the second instance, codons used by the chosen non-human host in the second or third place are 
selected, 

to avoid the appearance of such restriction sites which are used during the assembly of the gene, 
to create one unique cleavage site between a5'-fragment and the rest of the whole gene, and 
to eliminate 8-base-pairs long or longer palindromes within oligonucleotide subunits of fragments to 
be cloned, 

b) dividing the designed nucleotide sequence into a 5'-fragment to be chemically synthesized and 
a few fragments to be cloned so that joining points between said few fragments will be at suitably 
located G-C dlnucleotide sequences, 

c) modifying said designed few fragments of b} by supplementing the designed nucleotide 
sequences thereof with an extra nucleotide sequence GGTAC at the 5'-terminus, except for the 
fragment to be joined to the 5'-fragment of b), and further dividing said few fragments into subunits 
having a 3' -nucleotide G, which subunits in turn are individually supplemented with an extra 
nucleotide sequence GGCC; 

d) individually chemically synthesizing the modified supplemented subunits of c) in single- 
stranded form In per se known manner and chemically synthesizing the 5' fragment of b) in 
double-stranded form in per se known manner; 

e) consecutively cloning the synthesized subunits of d) starting from the 5'-terminus of the 
modified supplemented few fragments of c) into a few individual recombinant vectors in per se 
known manner, with the aid of adapters and enzymatical filiing-in reaction, to form cloned 
double-stranded fragments of the gene, which correspond to the modified supplemented few 
fragments of c), 

f) assembling the cloned double-stranded fragments of e) by cleaving the few recombinant 
vectors of e}, in pairs, with the enzyme Kpnl and the enzyme Apal, respectively, - one at the created 
5'-terminal Kpnl restriction site, and the other at the created 3' terminal Apal restriction site, - to form 
sticky ends which are made blunt ends by a single-strand-specific enzyme in per se known 
manner - leaving an end-nucleotide C and an end-nucleotide G, respectively - followed by cleavage 
with another restriction enzyme having a cleavage site which is unique in both of the recombinant 
vectors of the pair in question, 

to form on the one hand a linear vector containing a cloned fragment of the gene and, on the other 
hand, a cleaved-off fragment of the gene, which two last-mentioned fragments are, in per se known 
manner, enzymatically joined at the blunt ends -a dinucleotide G-C which is included in the 
nucleotide sequence of the gene, being formed at the joining point - 

to obtain a recombinant vector which finally includes all the few designed fragments of b) in 
double-stranded form, and 

g) supplementing the recombinant vector obtained in f) with the chemically synthesized 5' 
fragment of d) to form the whole structural gene coding for authentic human serum albumin. 
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1 1. A method of producing a structural gene according to claim 10, characterized in that 
in a), the chosen non-human host is yeast, 
in b), the designed nucleotide sequence is 
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GTT TCT GAC AGA GTT ACT AAG 
GTT AAC AGA AGA CCA TGT TTC 
GAA ACT TAG GTC CCA AAG GAA 
ACT TTC CAC GCC GAC ATC TGT 
AGA CAA ATC AAG AAG CAA ACT 
AAG CAC AAG CCA AAG GCT ACT 
GTT ATG GAC GAC TTC GCT GCT 
AAG GCT GAC GAC AAG GAA ACT 
AAG AAG TTG GTT GCT GCT TCT 
TAA TAG 



TGT TGT ACT GAA TCT TTG 
TCT GCC TTG GAA GTT GAC 
TTT AAC GCT GAA ACT TTC 
ACC TTG TCC GAA AAG GAA 
GCT TTG GTT GAA TTG GTT 
AAG GAA CAA TTG AAG GCT 
TTC GTT GAA AAG TGT TGT 
TGT TTC GCT GAA GAA GGT 
CAA GCT GCT TTG GGT TTG 



in which the arrows show the dividing points between the first 5' fragment to be chemically 
synthesized and four fragments to be cloned, 
in c), the supplemented single-stranded subunits of the modified fragments of b) are 

TAGGTGAAGAAAACTTCAAGGCTTTGGTTTTGATTGCTTTCGCTCAATACTTG- 
CAACAATGTCCATTCGAAGGGCC 

ACCACGTCAAGTTGGTCAACGAAGTTACTGAATTTGCTAAGACCTGTGTTGCT- 
GACGAATCTGCTGAAAACTGGGCC 

TGACAAGTCCTTGCACACTTTGTTCGGTGACAAGTTGTGTACTGTTGC- 
TACTTTGAGAGAAACTTACGGTGGGCC 

AAATGGCTGACTGTTGTGCTAAACAGGAACCAGAAAGAAACGAATGTTTCTTA- 
CAACACAAGGACGGGCC 

ACAACCCAAACTTGCCAAGATTGGTTAGACCAGAAGTCGACGTTATGTG- 
TACTGCTTTCCACGACAACGAAGGGCC 

AGACTTTCTTGAAGAAGTACTTGTACGAAATCGCCAGAAGACACCCATACTTC- 
TACGCTCCAGAATTGTTGTTCTTCGGGCC 

GGTACCTAAGAGATACAAGGCTGCTTTCACTGAATGTTGTCAAGCTGCCGA- 
CAAGGCTGCTTGTTTGTTGGGCC 

CCAAAGTTGGACGAATTGAGAGACGAAGGTAAGGCTTCTTCCGCTAAGCAAA- 
GATTGAAGTGTGCTTCCTTGGGCC 

CAAAAGTTCGGTGAAAGAGCCTTCAAGGCCTGGGCTGTTGCTAGATTGTCT- 
CAAAGATTCCCAAAGGCTGGGCC 

AATTTGCTGAAGTTTCTAAGTTGGTTACTGACTTGACTAAGGTTCACACT- 
GAATGTTGTCACGGTGACTTGGGCC 

TTGGAATGTGCTGACGACAGAGCTGACTTGGCTAAGTATATCTGTGAAAAC- 
CAAG ACTCTATCTCTTCTA AG G G CC 

TTGAAGGAATGTTGTGAAAAGCCATTGTTGGAAAAGTCTCACTGTATCGCT- 
GAAGTTGAAAACGACGAAATGGGCC 

GGTACCCAGCTGACTTGCCATCnTGGCTGCTGACTTCGTTGAATCTAAG- 
GACGTTTGTAAGAACTACGCTGAAGGGCC 

CTAAGGACGTTTTCTTGGGTATGTTCTTGTACGAATACGCTAGAAGACACCCA- 
GACTACTCCGTTGTTTTGTTGTTGGGCC 
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AGATTGGCTAAGACTTACGAAACTACTTTGGAAAAGTGTTGTGCTGCTGCT- 
GACCCACACGAATGTTACGCTAAGGGC 

GTTTTCGACGAATTTAAGCCATTGGTTGAAGAACCACAAAACTTGATTAAG- 
CAAAACTGTGAATTGTTCAAGGGCC 

CAATTGGGTGAATACAAGTTCCAAAACGCTTTGTTGGTTAGATACACTAA- 
G A AG GTTCCAC A AGTCTCC ACTCC AACTTTG G G CC 

GTTG AAGTCTCTAG AAACTTG G GTAAG GTTG GTTCTAAGTGTTGTA AGC A CC- 
CAGAAGCTAAGAGAATGGGCC 

GGTACCCATGTGCTGAAGACTACTTGTCtGTTGTTTTGAACCAAT- 
TATGTGTTTTGCACGAAAAGGGCC 

ACTCCAGTTTCTGACAGAGTTACfAAGTGTTGfACTGAATCTtTGGTTAACA- 
GAAGACCATGTTTCTCTGGGCC 

CCTTGGAAGTTGACGAAACTTACGTCCCAAAGGAATTfAACGCTGAAACTTT- 
CACTTTCCACGCCGACATCTGGGCC 

TACCTTGTCCGAAAAGGAAAGACAAATCAAGAAGCAAACTGGTTTGGTTGAA TTGGTTAAGCA- 
CAAGCCAAAGGGCC 

GCTACTAAGGAACAATTGAAGGCTGTTATGGAGGACTtCGCTTTCGTT- 
G AAA AGTGTTGTAAG G CTG ACG G G CC 

ACA AGG AAACTTGTTTCG CTGAAGAAG GTAAG AAGTTGGTTG CTG CTTCT- 
C AAG CTG CTTTr^G GTTTGT AAT AG GG CC 

in e), the synLiiesized subunits of d) are consecutively cloned into four individual E. eoli vectors 
with the aid of the adapters 



Apa* BcoRI 

CGGACGGCGACGGCGACGGGGACCG 
CCCGGGCCTGCCGCTGCCGCTGCGGCTGGCTTAA 



Apal EcoRI 

CGAGTATGCGACAGCTGG 
CCCGGGCTCATACGCTGTCGACCTTAA 



in f), the single-strand-specific enzyme is Klehow polymerase. 
12. A method of producing authentic human serum albumin by propagating a host transformed with a 
vector comprising a recombinant DNA sequence under expression and optionally secretion conditions 
and isolating the expressed and optionally secreted protein product, characterized by utilizing a host 
transformed with a vector comprising the structural gene according to any one of claims 1-7, and isolating 
authentic human serum albumin. 

T3; An authentic human serum albumin, characterized in that it results from the method of claim 1 2; 
14. A pharmaceutical composition comprising authentic human serum albumin according to claim 13 t in 
admixture with a pharmaceutical^ acceptable carrier and/or diluent. 
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FIG. 5 
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(8-kb EcoRI fragment: .PH05, P.H03 locus) 



FIG. 9 
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